wo 03/089572 



ESSENTIAL. AND EMPORTANT GENES OF PSEUPOMONAS AERUGINOSA 
^ ^rSNlTTMlJ^T^^tE OT^ OR mRNTIFY ANT I BACTERIAL 

AGENTS 

Cross Reference Related to Related Applications 

(00011 This application relates to U.S. Provisional Serial No. 60/372,095, filed on 
April 1 5, 2002 and which is mcorporated in their entirety by reference herein. 

Field of Invention 

[0002] The present invention relates to the identification of essential and important 
genes in Pseudomonas aeruginosa, and the vise thereof in screening assays and 
diagnostic methods to identify, evaluate or design antibacterial agents usefiil for the 
treatment of Pseudomonas infections. Such agents are particularly useful in preventing 
and treating opportunistic infections in immunocompromised individuals and for 
treating and preventing pulmonary infections in patients having cystic fibrosis disease. 
Also disclosed is a Bayessian statistical model that may be utilized to increase tiie 
statistical confidence that any given gene identified using the disclosed metbodology is 
essential. 

Background of Invention 

[0003] Pseudomonas aeruginosa is a versatile Gram-negative bacterium that is able 
to adapt to and thrive in many ecological niches, firom water and soil to plant and 
animal tissues. The bacterimn is capable of utilizing a wide range of organic 
compounds as food sources, thus giving it an exceptional ability to colonize ecological 
niches where nutrients are limited, such as soil, marshes and coastal marine habitats. 
Hardalo, C. & Edberg, S. C. Pseudomonas aeruginosa: assessment of risk from 
drinking water. Crit Rev. Mcrobiol 23, 47-75 (1997). It also forms biofilms on wet 
surfaces such as those of rocks and soil. Costerton, J. W., Stewart, P. S. & Greenberg, 
P. Bacterial biofilms: a common cause of persistent infections. Science 284, 1318- 
1322 (1999). Aheam, D. G., Borazjani, R. N., Simmons, R. B. & Gabriel, M. M. 
Primary adhesion of Pseudomonas aeruginosa to inanimate surfaces including 
biomatoials. Methods Enzymol 310, 551-557 (1999). Analysis of the P. aeruginosa 
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genome has identified genes involved in locomotion, attachment, transport and 
utilization of nutrients, antibiotic efflux, and two component and other regulatory 
systems involved in sensing and responding to environmental changes. Because its 
natural habitat is the soil, where it exposed to bacilli, actinomycetes and molds, it has 
developed resistance to a variety of their naturally-occurring antibiotics. 

[0004] The emergence of P. aeruginosa as a major opportunistic human pathogen 
during the past century may be a consequence of its resistance to the antibiotics and 
disinfectants that eliminate other environmental bacteria. P. aeruginosa is now a 
significant source of bacteraemia in bum victims, urinary-tract infections in 
catheterized patients, and hospital-acquired pneumonia in patients on respirators. 
Bodey, G. P., Bolivar, R., Fainstein, V. & Jadeja, L. Infections caused by Pseudomonas 
aeruginosa. Rev. Infect Dis. 5, 279-313 (1983). It is also the predominant cause of 
morbidity and mortality in cystic fibrosis patients, whose abnormal airway epithelia 
allow long-term colonization of the lungs by P. aeruginosa. Thus, people with cystic 
fibrosis, bum victims, individuals with cancer and AIDS, and patients requiring 
extensive stays in intensive care units are particularly at risk of disease resulting fi:om 
P. aeruginosa infection. P. aeruginosa is also a cause of a variety of different 
disorders including septicemia, urinary tract infections, pneumonia and chronic lung 
infections, endocarditis, dermatitis, osteochondritis, ear and eye infections, bone and 
joint infections, gastrointestinal infections and skin and soft tissue infections, including 
wound infections, pyoderma and dermatitis. 

[0005] Cystic fibrosis is one of the most common fatal genetic disorders in the United 
States, affecting about 30,000 individuals. A comparable number of people in Europe 
also have CF. It is most prevalent in the Caucasian population, occurring in one of 
every 3,300 live births. The gene involved in cystic fibrosis was identified in 1989 and 
codes for a protein called the cystic fibrosis transmembrane conductance regulator 
(CFTR). This protein, normally produced in a number of tissues throughout the body, 
regulates the movement of salt and water in and out of these cells. One haUmark of CF 
is the presence of a thick mucus secretion that clogs the bronchial tubes in the lungs and 
plugs the exit passages firom pancreas and intestines, leading to loss of function of these 
organs and resulting in a predisposition toward chronic bacterial infections. 



wo 03/089572 



3 



PCT/US02/35518 



Pseudomonas aeruginosa^ having a propensity to live in warm, wet environments, is a 
particular problem for CF patients, whose limgs typically become colonized (inhabited 
long-term) by P. aeruginosa before their 10th birthday. Although antibiotics can 
decrease the frequency and duration of these attacks, resistant bacteria are quick to 
develop and the bacteria are never completely eradicated from the lung. More effective 
antibiotics are necessary for improving lung function and quality of life for CF patients 
for extended time periods. 

[0006] Pseudomonas aeruginosa is notorious for its resistance to antibiotics and is, 
therefore, a particularly dangerous and dreaded pathogen. Todor, K. 2000 
Pseudomonas aeruginosa. University of Wisconsin-Madison, 
http://www.bact.vvisc.edu/niicrotextbook/disease/ pseudomonas.html, available on 
April 25, 2001 . The permeability barrier afforded by its outer membrane LPS also 
contributes to its natural antibiotic resistance, as do the presence of two antibiotic 
resistance plasmids, both R-factors and RTFs, which are commonly transferred 
between cells by tiie bacterial processes of transduction and conjugation. Only a few 
antibiotics are effective against Pseudomonas, including tobramyocin (TOBI; Chkon), 
fluoroquinolone, gentamicin and imipenem, and even these antibiotics are not effective 
against all strains. 

[0007] Pseudomonas aeruginosa disease generally begms with some alteration or 
circumvention of normal host defenses and may involve several different vinilence 
determinants. Todor, 2000, The ultimate P^cwrfoiwowoy infection may be seen 

as composed of three distinct stages: (1) bacterial attachment and colonization; (2) local 
invasion; (3) disseminated systemic disease. Particular bacterial determinants of 
virulence mediate each of these stages and are ultimately responsible for the 
characteristic syndromes that accompany the disease. For instance, Pseudomonas 
utilize fimbriae or pili to adhere to the epithelial cells, apparentiy via binding to specific 
galactose or maxmose or sialic acid receptors on epithelial cells. Fimbrial adherence 
may be an important step in Pseudomonas keratitis and urinary tract infections, as well 
as infections of the respiratory tract Mucoid strains, which produce an a 
exopolysaccharide (alginate) have an additional or alternative adhesin which attaches to 
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the tracheobronchial mucin (N-acetylglucosamine). Therefore, mucoid strams of P. 
aeruginosa are conmionly seen in lung infections. 

[0008] The ability of P. aeruginosa to invade tissues depends upon its resistance to 
phagocytosis and the host immune defenses, and the extracellular enzymes and toxins 
that break down physical barriers and otherwise contribute to bacterial invasion. 
Todor, 2000, supra. For instance, Pseudomonas dastase cleaves collagen, IgG, IgA, 
arid complement, and also lyses fibronectin to expose receptors for bacterial attachment 
on the mucosa of the lung. Alkaline protease interferes with fibrin formation and lyses 
fibrin. Together, elastase and alkaline protease destroy the ground substance of the 
cornea and other supporting structures composed of fibrin and elastin. Elastase and 
alkaline protease together are also reported to cause the inactivation of gamma 
Interferon (BFN) and Tumor Necrosis Factor (TNF). 

[0009] P. aeruginosa produces three other soluble proteins involved ia invasion, 
including a cytotoxin (MW 25,000) and two hemolysins. Todor, 2000, supra. The 
cytotoxin is a pore-forming protein originally named leukocidin because of its effect on 
neutrophils, but it appears to be cytotoxic for most eukaryotic cells. Of the two 
hemolysins, one is a phospholipase and the other is a lecithinase. They appear to act 
synergistically to break down lipids and lecithin. The cytotoxin and hemolysins 
contribute to invasion through their cytotoxic effects on eukaryotic cells. 

[0010] Pseudomonas aeruginosa also produces two extracellular protein toxins, 
Exoenzyme S and Exotoxin A. Exoenzyme S may act to impair the function of 
phagocytic cells in the bloodstream and internal organs to prepare for invasion by P. 
aeruginosa^ and is typically produced by bacteria growing in bumed tissue. Exotoxin 
A is partially identical to diphtheria toxin, and exhibits a necrotizing activity at the site 
of bacterial colonization and is tibiereby thought to contribute to the colonization 
process. Lidirect evidence involving the role of exotoxin A in disease is seen in the 
increased chance of survival in patients with Pseudomonas septicemia that is correlated 
with the titer of anti-exotoxin A antibodies in the serum. 

[0011] While therapeutic measures aimed at any of the above virulence factors may 
help to slow the progression of an infection and may be useful in combined therapeutic 
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regimens, given the variety of virulence factors of P. aeruginosa^ antibacterial agents 
that inhibit growing bacteria by interacting with essential genes and essential gene 
products are necessary. Although, this is not to say that genes encoding virulence 
factors would not be essential to survival in particular niches or environments, 
emphasi^g the importance of screening for gene essentiality in various pathogenic 
environments. See, e.g.. Coulter et al., 1998, Staphylococcus aureus genetic loci 
impacting growth and survival in multiple infection environments, Mol. Microbiol. 
30(2): 393-404. However, as P. aeruginosa becomes more and more resistant to 
existing antibacterial agents, new compounds are required. 

[0012] Indeed, reports of bacterial strains resistant to the most powerful known 
antibiotics are becoming more common, signaling that new antibiotics are needed for 
all bacteria, not only P. aeruginosa. For instance, the United States Center for Disease 
Control recentiy announced that one of the most powerful known antibiotics, 
vancomycin, was unable to treat an infection of Staphylococcus aureus (staph), an 
organism commonly found in the environment and responsible for many nosocomial 
infections. If this trend continues, some have warned that we could return to a time . 
when a common bacterial infection is a life threatening matter. See Zyskind et al., WO 
00/44906, pubUshed August 3, 2000. 

[0013] Historically, however, the identification of new antibacterial drugs has been 
painstaking and laborious witii no guarantee of success. Traditional methods involve 
blindly and randomly testing potential drug candidate molecules, with the hopes that 
one might be effective. Today, the average cost to discover and develop a new drug is 
nearly $500 million, and the average time is 15 years from laboratory to patient. New 
identification and screening methods that shorten and improve this process are much 
needed. 

[0014] A newly emerging technique for identifying new antibacterial agents is to first 
identify gene sequences and proteins required for the proliferation of bacteria, or 
"essential" genes and proteins, and then conduct a biochemical and structural analysis 
of that target gene or protein in order to derive compounds that interact with the target. 
Such methodology employs molecular modeling techniques, combinatorial chemistry 
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and other means to design candidate drags, and offers a more directed alternative to 
merely screening random compounds with the hope that one might be suitable for a 
particular bacterium. 

[0015] Nevertheless, even this preferred approach presents obstacles including the 
identification of essential genes and proteins, and the design of new assays for the 
genes thus identified in order to eflBciently screen candidate compounds. Several 
groups have proposed systems for the identification of essential genes. For instance, 
Zyskind and colleagues propose a method of identifying essential genes in Escherichia 
coli by subcloning a library of coli nucleic acid sequences into an inducible 
expression vector, introducing the vectors into a population of E, coli cells, isolating 
those vectors that, upon activation and expression, negatively impact the growth of the 
E. coli cell, and characterizing the nucleic acid sequences and open reading frames 
contained on the subclones identified. See WO 00/44906, herem incorporated by 
reference. The disadvantage of this method is that the overexpression of nonessential 
genes can also negatively impact the cell, particularly the overexpression of membrane 
proteins and sugar transport proteins that are not necessary for growth where alternative 
carbon sources exist. Such proteins typically become trapped in membrane export 
systems when the cell is overloaded, and would be identified by this methodology. See 
Muller, FEMS Microbiol. Lett. 1999 Jul l;176(l):219-27. 

[0016] Another group proposes flie identification of growth conditional mutants, and 
more specifically temperature sensitive (ts) mutants, as a means to identify essential 
genes m Staphylococcus aureus. See Benton et al., U.S. Patent 6,037,123, issued 
March 14, 2000, herein incorporated by reference. Each gene is identified by isolating 
recombinant bacteria derived from growth conditional mutant strains, i.e., following 
introduction of a vector containing a Ubrary of nucleic acid sequences, which would 
grow under non-permissive conditions but which were not revertants. These 
recombinant bacteria were found to contain DNA inserts that encoded wild type gene 
products that replaced the function of the mutated gene xmder non-pennissive growth 
conditions. By this method, Benton and colleagues were able to identify 38 loci on the 
S. aureus chromosome, each consisting of at least one essential gene. 
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[0017] The disadvantages of this method are first, the chemical employed to induce 
mutagenesis (diethyl sulfate, DES) is capable of causing several mutations in the same 
cell, thereby complicating iaterpretation of the results. Second, the method is 
particularly labor intensive in that one must painstakingly analyze replica plates of 
individual colonies grown at permissive and non-permissive temperatures, where 
replica plates include both mutant and non-mutant cells. Thus, employing the 
appropriate level of mutagen to achieve a balance between minimizing the number of 
non-mutant colonies one must screen in order to identify one mutant, while at the same 
time avoiding multiple mutations in the same cell, may be an arduous task. 

[0018] Another group has proposed a transposon mutagenesis system for identifying 
essential genes called "GAMBIT" ("genomic analysis and mapping by in vitro 
transposition")* and has used the system to identify essential genes fiarst in the gram 
positive bacteria Haemophilus influenzae and Streptococcus pneumoniae, and more 
recently in Pseudomonas aeruginosa. See Akerley et al.. Systematic identification of 
essential genes by In vitro mariner mutagenesis, Proc. Natl. Acad. Sci USA 95(15): 
8927-32; Wong and Mekalanos, 2000, Proc. Natl. Acad. Sci. USA 97(18): 10191-96; 
and Mekalanos et al., U.S. Patent No. 6,207,384, issued March 27, 2001, herein 
incorporated by reference. GAMBIT involves first isolating and purifying specific 
genomic segments of approximately 10 kilobases using extended-length PGR, and 
creating a high density transposon insertion map of the isolated region using Himarl 
transposon mutagenesis. The transposon insertions are then transferred to the 
chromosome following transformation of the bacteria with the transposon containing 
vectors, and selection for the antibiotic resistance marker on the transposon. The 
position of each transposon insertion with respect to a given PGR primer is then 
determined by genetic footprinting, i.e., by amplifyong sub-PGR products using one of 
the original PGR primers and a primer that recognizes an internal site in the Himarl 
transposon. By analyzing the length of PGR fragments thus identified, it is possible to 
identify regions that are devoid of transposon insertions, thereby si gnaling regions that 
might contain essential genes. 



[0019] While the GAMBIT method is a good technique for looking at a small region 
of the genome for essential genes, it would be extremely labor intensive to use this 
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method for analyzing the entire genome. This is particularly true for P. aeruginosa^ 
whose genome {^6 megabases) is about 70% greater in size than the K influenzae 
genome (--1 .8 megabases). Furthermore, GAMBIT would not be readily applicable to 
use in organisms that are less recombinogenic than K influenzae. Indeed, while the K 
influenzae genome contains about 1700 protein coding genes, P. aeruginosa contains 
about 5570. According to U.S. Patent 6,207,384, one would need to clone and 
mutagenize the 6 ndllion base pair genome of P. aeruginosa in 10,000 base pair 
fragments, isolating and characterizing 400-800 mutants per 10,000 base pair firagment. 
Generating 6X10^ mutants and characterizuig them via PGR on gels would require a 
significant investment of labor, materials and time. 

[0020] Another group at Abbott Laboratories has proposed a genome scaiming 
method for identification of putative essential genes in K influenzae^ whereby random 
transposon insertions are mapped and analyzed to identify open reading firames 
containing no insertion in order to identify putative essential genes. Reich et al., 1999, 
Genome Scanning in Haemophilus influenzae for Identification of Essential Genes, J. 
Bacteriol. 181(16): 4961-68. However, even thougji transposon insertions were 
isolated that spanned the whole genome, the authors employed a genomic footprinting 
technique similar to that used in GAMBIT to map insertions in a short contiguous 
region of the chromosome. The method further employs the methods of mutation 
exclusion and zero time analysis in order to monitor the fate of individual insertions 
after transformation in growing culture, which looks at individual insertions on a case- 
by-case basis. Again, such techniques would be extremely labor-intensive for the P. 
aeruginosa genome, which is 70% larger tiian the genome of K influenzae. 

[0021] Wong and Mekedanos also proposed identifying essential genes in P. 
aeruginosa by starting with the knowledge of three essential genes in K influenzae and 
using genetic footprint analysis to determine if the homologues of these genes are 
essential in P. aeruginosa. Of three homologues tested, only one was unable to 
accommodate a transposon insertion. Sfee Wong and Mekalanos, 5i(pm. Such results 
underscore the fact that a gene that is shown to be essential in one species will not 
necessarily be essential in another, given that some gene products may fulfill different 
functional roles in different species. Furthermore, given the larger coding capacity of 
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the P, aeruginosa genome relative to that of other bacteriet, it would not be surprising 
for P, aeruginosa to possess an increase in redundant gene functions, thereby 
decreasing the actual number of essential genes, and making them more difficult to 
identify. 

10022] Another metiaod is entitled Transposon Mediated Differential Hybridisation 
(TMDH), which is disclosed in WO 01/07651, herein incorporated by reference. This 
method entails (i) providing a library of transposon mutants of the target organism; (ii) 
isolating polynucleotide sequences from the library which flank inserted transposons; 
(iii) hybridising said polynucleotide sequences with a polynucleotide library from said 
organism; and (iv) identifying a polynucleotide in the polynucleotide library to which 
said polynucleotide sequences do not hybridise in order to identify an essential gene of 
the organism. However, the problem with this methodology is that it has a high 
propensity to lead to false positives, and many essential genes will be missed. 
Furthermore, the method does not yield any detailed information regarding the loci 
dismpted by transposons, or whether they were hit more than once. 

[0023] Thus, there is a great need for more efficient methods to identify essential 
genes, particularly in jP. aeruginosa^ so that new antibacterial agents may be designed 
therefrom for use in treatment of P. aeruginosa infections. 

Summary of Invention 

[0024] The present inventors have devised a database of potential essential or 
otherwise important genes in P. aeruginosa^ which may be used to verify essentiality 
and design antibacterial agents active against the targets thus identified. In particular, 
the inventors have isolated and mapped a library of at least about 5,000 to at least about 
14,000 transposon insertions in the genome of P. aeruginosa^ and more preferably a 
library of at least about 8000 to at least about 14,000 transposon insertions, and even 
more preferably a library of at least about 10,000 to at least about 14,000 transposon 
insertions, using the recently published P. aeruginosa gene sequence. The map thus 
generated was used to form a database of approximately 1500 to 3000 open reading 
frames, or more preferably about 1500 to 2000 open reading frames, for which no 
transposon insertions could be obtained, each of which possibly represents an essential 
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gene required for growth and proliferation of P. aeruginosa on rich media, or an 
important gene, the mutation of which results in an attenuated growth mutant. Also 
disclosed is a Bayessian statistical model that may be utilized to increase the statistical 
confidence that any given gene identified using the disclosed methodology is essential. 

[0025] Thus, one aspect of the invention is a database of putative essential or 
otherwise important genes, defined by the absence of transposon insertions in those 
genes m a High Throughput Transposon Insertion Map (HTTIM) database comprising 
about 10,000 to about 14,000 transposon insertions in the genome of Pseudomonas 
aeruginosa. Minimally, such a database comprises approximately 1800 open reading 
fi^es (ORFs), each of which may be fiirther tested for essentiality using a variety of 
tests disclosed herein. However, predictions of essentiality or importance may be 
bolstered based on iCTtgth of the ORF and predicted function and other statistical 
factors, thereby providing for more narrow databases of putative essential genes. Hius, 
the invention also includes databases that are more narrow and comprise only those 
genes for which essentiality or importance may be predicted with at least an 80% 
confidence level, and include at least about 850 to about 875 genes. The invention also 
includes databases assigned a confidence level of about 85% and including at least 
about 675 to about 700 genes. The invention fiirther includes databases assigned a 
confidence level of about 90% including at least about 475 to about 500 genes. 
Further, the invention includes databases assigned a confidence level of about 95% and 
including at least about 200 to 250 genes. 

[0026] The transposon insertion map and database of putative essential or otherwise 
important open reading firames (ORFs) obtained may be vised to confirm the essentiality 
or importance of genes, for example by integration knock outs in the presence of 
chromosomal complementation or by integration and activation of a regulatable 
promoter. An "essential" gene is one that cannot be "knocked out," i.e. for which null 
mutants having complete absence of the gene product are not Adable. This does not 
mean, however, that such genes could not tolerate point mutations or truncations that 
preserve sufBcient gene product fimction so as to enable cell growth and survival. 
Essential genes are to be distinguished from "important" genes, which are also included 
in the present invention, in that a "knock ouf ' of an important gene does not lead to cell 
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death but rather results in an attenuated growth mutant Such genes may be included in 
the database of open reading frames not hit by random transposon mutagenesis as 
described herein, because attenuated growth colonies may be significantly smaller than 
the average P. aeruginosa colony and may have been overlooked when transposon 
insertion mutants were picked to generate the high throughput transposon insertion 
database (EnTIM). 

[0027] Nevertheless, important gene products may interact with or regulate other 
genes, gene products or cellular processes that are essential, thereby making such gene 
products appropriate targets for drug design. Moreover, most drags don't effectively 
kill all the pathogenic bacteria in the body; rather, they kill or growth attenuate a 
portion of the bacteria, empowering the immune system to target the remainder. 
Hence, important genes that, when targeted with an antibacterial agent, result in 
attenuated growth, are also targets for the antibacterial drags of the present invention. 

[0028] The invention also includes a database of attenuated growth mutants identified 
from the HTTIM transposon database. The genes marked by such mutations are of the 
same class of importance as the "important genes identified in the no-hit database of 
genes, except that the growth attenuated nature of such transposon mutants was 
discovered at the transposon mutagenesis stage, rather than at the stage where 
essentiality is tested via targeted knock out. Thus, genes that when mutated confer 
attenuated growth may be identified from two sources: (1) from the library of open 
reading frames that did not receive a transposon insertion during HTTIM but were 
subsequently identified as an important gene when essentiality was tested via knock out 
and/or promoter swap strategies, and (2) from the HTTIM database itself when in the 
process of accxnnulating transposon insertion mutants it W£is observed that a particular 
insertion conferred an attenuated growth phenotype, 

[0029] Such attenuated mutants grow more slowly than wild type, and may grow 
more slowly due to reduced expression of an essential gene, i.e., transposon is in gene 
that regulates expression of an essential gene, or due to expression of a truncated form 
of an essential gene, i.e., transposon is in the essential gene itself and leads to 
expression of a truncated mRNA. For example, mutants that show a higher drug 
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susceptibility could be the result of insertions in a gene that potentiates resistance, such 
an efflux punaip, or due to reduced expression of essential genes involved in the 
mechanism of action of the drug. Expression of mutated forms of essential and 
important genes may make the cell more susceptible to compounds that inhibit that 
particular gene or gene product, and may allow the identification of antibacterial agents 
with greater sensitivity. Furthermore, screening in whole cells overcomes the potential 
problems of uptake and efflux that are sometimes an issue for compounds identified via 
enzyme-based assays. 

[0030] The essential and important genes of the invention may be used to design, 
screen for and evaluate potential antibacterial agents for tiie purpose of developing new 
treatments for P. aeruginosa infection. Antibacterial agents identified according to the 
invention may have activity against the gene or against the corresponding gene product 
or metabolic pathways requiring the gene product For instance, antibacterial agents 
according to the invention may include antisense nucleic acids or regulatory proteins 
that bind to open reading firames, to upstream polar sequences or to promoters that 
drive expression of the genes encoded by such open reading firames. Active agents 
according to the invention may also include antibodies or proteins that bind to proteins 
encoded by open reading firames, or to transcriptional or translational regulators of such 
genes or proteins, or to binding partners of such proteins. Agents may also be chemical 
compoimds designed following molecxilar modeling of essential gene products 
according to the invention, or mutant proteins designed therefirom that compete with the 
essential wild type protein for reactive cell components or for interacting nutrients, as 
well as agents firom random chemical. 

[0031] The present invention therefore includes methods and assays for identifying 
antibacterial agents having specificity for the essential or important open reading 
firames identified, or to genes and proteins that interact with such open reading firames 
or the products encoded thereby. Once essential and important open reading firames are 
identified, antibacterial agents may be identified using the assays and methods 
described herein, or by any suitable assay. Such assays may vary depending on the 
function delineated for each essential locus, as would be apparent to those of skill in the 
art. For instance, enzyme assays may be designed based on the predicted fimction of 
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essential and important genes in order to define classes of inhibitors to be tested. Also, 
random chemical libraries may be screened for activity against the isolated genes or 
gene products. Cell lines may be designed or isolated that demonstrate reduced 
expression of essential genes, thereby providing a sensitive screening tool for inhibitors 
that effect the activity of that gene or gene product as it functions in the cell. Such cell 
lines mtay be devised firom cells having transposon insertions that lead to attenuated 
growth, or may be constructed by the promoter swap techniques described herein, by 
using a regulatable promoter that can be used to increase gene expression, allowing for 
confirmation of target specificity. Here, the minimal inhibitory concentration of the 
inhibitor is directly related to the expression level of the target gene, such that under 
low expression, an attenuated growth cell is more susceptible to an inhibitor than the 
wild type strain, and as you raise the expression level, the minimum inhibitory 
concentration (MIC) increases. The MIC shift will be consistent when the inhibitor 
acts on the regulated target. 

[0032] Active agents and compoxmds can be formulated into pharmaceutical 
compoimds and compositions, effective for treating and preventing Pseudomonas 
infections in accordance with the methods of the invention. Such therapy will be 
particularly useful in the hospital setting for preventing and treating nosocomial 
infections, and for administering to cystic fibrosis patients to improve limg function and 
quality of life. Depending on the activity of the essential or important gene targeted, 
such agents could also be useful in treating all types of Pseudomonas infections ranging 
from bacteraemia and septicemia, urinary-tract infections, pneumonia and chronic lung 
infections, bum infections, cancer, ADDS, endocarditis, dermatitis, osteochondritis, ear 
and eye infections, bone and joint infections, gastrointestinal infections and skin and 
soft tissue infections, including wound infections, pyoderma and dermatitis. Further, 
the invention provides pharmaceutical compositions appropriate for use in methods of 
treating bacterial infections described above. 
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Brief Description of the Drawings 

[0033] Figure 1 . Depiction of a single crossover recombination event resulting in 
integration of a plasmid into the bacterial chromosome. Isolation of such recombinants 
indicates that the targeted gene is not essential. 

[0034] Figure 2. Single crossover and integration of a plasmid resulting in the 
replacement of a wild type promoter with a regulatable promoter. 

[0035] Figure 3. Depiction of the 'promoter swap' strategy, using transformation of 
pBEMl 0 into P. aeruginosa in order to replace the IpxC promoter with the arabinose 
oraBAD promoter, thereby allowing modulation of its IpxC expression by the use of a 
simple sugar, arabinose. 

[0036] Figure 4. Graph showing the susceptibility or non-susceptibility of various E. 
coll and P. aeruginosa strains to the inhibitor L161,240. 

[0037] Figure 5. Graph depicting the effect of tetracycline and L161,240 on the 
growth of P. aeruginosa strain PAOl with and without polymixin permeabilization. 

[0038] Figure 6. Sensitivity of various E. coli and P. aeruginosa strains to inhibitor 
L161,240 following promoter swap and transformation with vector expressmg E. coli 
IpxC or P. aeruginosa IpxC. E, coli "swaps" refer to P. aeruginosa containing a vector 
comprising E. coli IpxC, and "PA swaps" refer to P. aeruginosa containing a vector 
comprising P. aeruginosa IpxC. 

[0039] Figure 7. Graph illustrating ORF coverage by Tn5 achieved in High- 
Throughput Transposon Insertion Mapping (HTTIM), wherein 30% of the genes in the 
genome are candidate essential genes where ORF size is not taken into account in 
predicting essentiality. 

[0040] Figure 8. Graph depicting the probability of identifying an essential gene 
given no transposon insertion, as a function of gene size. 

[0041] Figure 9. A ckcular map of the P. aeruginosa genome showing distribution of 
transposon insertion sites constituting a HTTIM of the invention, and demonstrating the 
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random nature of the transposon employed. The length of the bars radiating outward 
from the center of the circiilar map reflect the number of transposon msertions per non- 
overlapping kilobase. 

[0042] Figure 10. Histogram depicting the number of ORFs in the P. aeruginosa 
genome of (a) up to 4000 base pairs and (b) from 4000 up to 16884 base pairs. 

[0043] Figure 1 1 . Graph showing lilcelihood and accumulative likelihood gains. 

[0044] Figure 12. Trajectory of the algorithm projected in a subspace spanned by two 
gene sizes. The x-axis represents genes of sizes 151-160 DNA base-pairs and y-axis 
represents genes of sizes 171- 180 DNA base-pairs. Here n=(2,l) and M=(7,9). The 
median gene size of each group is used as the gene size. At iteration number 66, the 
likelihood gain is maximum in the direction of increasing the number of nonessential 
genes by one for genes with size 171-180 DNA base-pairs. At iteration number 443, the 
largest likelihood gain is obtained in the direction of increasing one nonessential genes 
for genes of sizes 1 5 1-160 DNA base pair. At any point, moving backwards has a 
negative likelihood gain. 

[0045] Figure 13. More trajectories of the searching algorithm projected in different 
subspaces. 

[0046] Figure 14. Plot of likelihood for different initial values. 

[0047] Figure 15. Trajectories of the algorithm with different starting values 
projected in the subspace spanned by two gene sizes: 1 101-1 150 DNA base-pairs for 
X-axis and 921-930 DNA base-pairs for y-axis. 

[0048] Figure 16. Top: (A) The top line is M , number of genes, the bottom line is fi , 
the number of genes with at least one observed insertion; the line in the middle is N , the 
number of estimated nonessential genes. For demonstration purpose, a cubic spline 
smooth is applied to the data. Bottom: Histogram of resamples of f (B) and X (C). 

[0049] Figure 1 7. Plot of nJm^ . The doted line is the value of nJm^ and the solid line 
is a moving average smooth. 
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[0050] The essential and important open reading frames identified in the present 
invention were originally part of a library of putative nucleic acid sequences generated 
from P. aeruginosa strains PAOl and PAK. See Table 1. Nevertheless, it is expected 
that the genes identified will also be essential or important in related P, aeruginosa 
strains as well as other Pseudomonas species, given the low sequence diversity that 
exists between P. aeruginosa strains of widely diverse envirotmients and the 
pronoimced structural and functional homology of gene products. See, e,g,, 
Spangenberg et al., 1998, Structural and functional implications of sequence diversity 
of Pseudomonas aeruginosa genes oriT, ampC and fliC, Electrophoresis 19(4): 545-50; 
Ruimy et al., 2001, Genetic diversity of Pseudomonas aeruginosa strains isolated from 
ventilated patients with nosocomial pneumonia, cancer patients, bacteremia, and 
environmental water. Infect. Immun. 69(1): 584-8. For instance, comparative 
sequencing of several P. aeruginosa genes from several environmental and cliiucal 
isolates revealed the sequence diversity to be about one order of magnitude lower than 
in comparable housekeeping genes from Salmonella. See Kiewitz and Tummler, 2000, 
Sequence diversity of Pseudomonas aeruginosa: impact on population structure and 
genome evolution, J, BacterioL 182(11): 3125-35. Thus, it is expected that agents 
identified as antibacterial based on their interaction with genes or gene products of P. 
aeruginosa PAOl or PAK will be broadly applicable as antibacterial agents against a 
variety of Pseudomonas species as well as other bacteria including but not limited to 
Escherichia, Hemophilus, Vibrio, Borrelia, Enterococcus, Heliobacter, Legionella, 
Mycobacterium, Mycoplasma, Neisseria, Staphylococcus, Streptococcus, etc. 

[0051] Thus, the present invention encompasses an isolated nucleic acid molecule 
comprising a nucleic acid sequence encoding a polypeptide having at least 80% 
sequence identity to a polypeptide encoded by a nucleic acid sequence selected from 
the group consisting of the Pseudomonas aeruginosa open reading frames (ORFs) 
listed in Table 1. More preferably, the present invention encompasses an isolated 
nucleic acid molecule comprising a nucleic acid sequence encoding a polypeptide 
having at least about 85 to 90% sequence identity to a polypeptide encoded by a nucleic 
acid sequence selected from the group consisting of the Pseudomonas aeruginosa open 
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reading frames (ORFs) listed in Table 1 . Even more preferably, the present invention 
encompasses an isolated nucleic acid molecule comprising a nucleic acid sequence 
encoding a polypeptide having at least about 90 to about 95% sequence identity to a 
polypeptide encoded by a nucleic acid sequence selected from the group consisting of 
the Pseudomonas aeruginosa open reading frames (ORFs) listed in Table 1. 

[0052] In particular, the invention encompasses isolated nucleic acid molecxiles 
comprising nucleic acid sequences encoding polypeptides having at least 80% sequence 
identity, or more preferably at least about 85 to 90 to 95% identity, to a polypeptide 
encoded by an essential or important nucleic acid sequence selected from the group 
consisting of the Pseudomonas aeruginosa open reading frames (ORFs) listed in Table 
1, wherein essentiality or importance of said nucleic acid sequence is determined by 
integration knock-out coupled with extra-chromosomal complementation. Likewise, 
the invention encompasses isolated nucleic acid molecules comprising nucleic acid 
sequences encoding polypeptides having at least 80% sequence identity, or more 
preferably at least about 85 to 90 to 95% identity, to a polypeptide encoded by an 
essential nucleic acid sequence selected from the group consistmg of the Pseudomonas 
aeruginosa open reading frames (ORFs) listed in Table 1, wherein essentiality or 
importance of said nucleic acid sequence is determined by integration of a regulatable 
promoter into the gene, or via any other suitable method, 

[0053] In one embodiment, tiie polynucleotides of the invention are recombinant. 
Recombinant polynucleotides of the invention include proteins of genondc, cDNA, 
semisynthetic, or synthetic origin, which, by virtue of its origin or manipulation (1) is 
not associated with all or a portion of a polynucleotide with which it is associated in 
nature; (2) is linked to a polynucleotide other than that to which it is linked in nature; or 
(3) does not occur in nature. 

[0054] Given that the library of nucleic acid sequences encompassed in Table 1 
provides an unprecedented tool useftil for the identification of essential and otherwise 
important genes in Pseudomonas and the construction and isolation of attentuated 
mutants, the present invention includes a library of nucleic acid sequences consisting 
essentially of nucleic acid sequences having at least 70% sequence identity, or more 
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preferably at least about 80 to 90 to 95% identity, to a nucleic acid sequence selected 
firom the group consisting of the Pseudomonas aeruginosa open reading frames (ORFs) 
listed in Table 1, wherein said library of nucleic acid sequences is employed to identify 
essential or otherwise important genes or to construct or isolate attenuated mutants in 
Pseudomonas. 

[0055] Also encompassed in the invention is a map of at least about 10,000 to about 
14,000 transposon insertions in the genome of Pseudomonas aeruginosa (High- 
Throughput Transposon Insertion Database or HTTIM), wherein said map is useftil for 
identifying genes that are essential or important for survival of said Pseudomonas 
aeruginosa^ i.e., by permitting the generation of a database of open reading frames fliat 
do not contain a transposon insertion. Figure 9 contains a circular map of the P. 
aeruginosa genome depicting 12,000 to 13,000 transposon insertion sites constituting a 
HTTIM of the invention, and demonstrates the random nature of the transposon 
employed. The length of the bars radiating outward from the center of the circular map 
reflect the number of transposon insertions per non-overlapping kilobase. Table 3 
contains a list of 13,51 5 specific Txi5 transposon insertion sites generated in either PAK 
or PAOl, with the 473 mutants 12516-13043 being identified as attenuated for growth. 

[0056] Thus, the databases and libraries disclosed herein may be used to formulate 
usefril subsets of these libraries and databases. Accordingly, the invention includes 
subsets of the databases and libraries disclosed. For instance, mutants 12516-13043 are 
identified as attenuated for growth and as such, the genes in this subset could be useful 
drug targets. Accordingly, this group of 473 mutants from the HTTIM database of 
13,515 transposon hits provides a usefiil subset database for comparing homologies 
with essential genes of other organisms, for computer modeling of potential 
antibacterial agents, etc. A particularly useful database subset is one containing 
essential genes from P. aeruginosa that are also identified as essential in other Gram 
negative or Gram positive bacteria. Indeed, genes that have essential homologs in other 
bugs are likely to provide useful targets for broad spectrum antibacterial agents, i.e., 
agents that have broad spectrum activity as an antibacterial agent Genes in the 
putative essential or important gene database have already been identified via BLAST 
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or other database analyses, and constitute an exemplary subset database of the present 
invention. See Table 4. 

[0057] Further^ the databases and subset databases of the present invention may also 
be used as comparative tools with oflier like databases or database subsets to identify 
broad spectrum. For instance, particularly envisioned is an embodiment wherein the 
database of putative essential and important genes identified in P. aeruginosa is cross- 
referenced with a similar database formed firom S. aiireus, wherein homologues present 
in both databases signal a potential target for a broad spectrum antibacterial agent. 
Cross-referencing between P. aeruginosa and S. aureus in particiilar will identify 
antibacterial targets for identifying broad spectrum antibiotics active against both Gram 
negative and Gram positive bacteria. However, databases derived firom any bacteria ' 
could be employed in such comparisons, as well as databases formed firom yeast, fimgi, 
mycoplasma, and other potential pathogens. 

[0058] Also encompassed in the invention is the use of essential and important genes 
and the corresponding proteins expressed thereto in the design of vaccines for eliciting 
prophylactic or therapeutic immune responses against Pseudomonas aeruginosa. 

[0059] Such vaccines will typically comprise a Pseudomonas aeruginosa protein 
antigen or firagment or variant thereof encoded by an essential gene. Additionally, such 
antigens will preferably be a protein expressed on the surface of the bacteria. 

[0060] Such vaccines will typically comprise a Pseudomonas aeruginosa protein 
antigen or firagment or derivative thereof encoded by an essential or important gene. 
Preferably, the protein antigen expressed firom a recombinant polynucleotide. 

[0061] Where the invention is durected to a firagment of a protein encoded by an 
essential or important gene, said firagment is preferably at least 8 to 12 amino acids 
long, and even more preferably at least about 20 to 30 amino acids long. Preferably, 
the firagment comprises either a B cell or a T cell epitope. 

[0062] Where the invention is directed to a derivative of a protein encoded by an 
essential or important gene, said derivative contains one or more amino acid 
substitutions, additions or deletions. Preferably, the amino acid substitutions are 
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conservative amino acid replacements. Conservative amino acid replacements are 
those that take place within a family of amino acids that are related in their side chains. 
Genetically encoded amino acids are generally divided into four families: (1) acidic = 
aspartate, glutamate; (2) basic = lysine, arginine, histidine; (3) non-polar = alanine, 
valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan; and (4) 
uncharged polar = glycine, aspars^ine, glutamine, cystine, serine, threonine, tyrosine. 
Phenylalanine, tryptophan, and tyrosine are sometimes classified jointly as aromatic 
amino acids. For example, it is reasonably predictable that an isolated replacement of a 
leucine with an isoleucine or valine, an asparate with glutamate, a threonine with a 
serine, or a similar conservative replacement of an amino acid with a structurally 
related amino acid will not have a major effect on the biological activity. Polypeptide 
molecules having substantially the same amino acid sequence as the protein by " 
possessing minor an^no acid substitutions that do not substantially affect the functional 
aspects are encompassed with the scope of derivatives of the proteins of the invention. 

[0063] The polypeptide fragment or derivative is preferably immunologically 
identifiable with the polypeptide encoded by the essential or important gene. The 
polypeptide firagment or derivative is preferably immunogenic and is able to cause a 
hiunoral and/or cellular immune response, either alone or when linked to a carrier, in 
the presence or absence of an adjuvant. The polypeptide fi-agment or derivative may be 
fused to or incorporated into another polypeptide sequence. This other polypeptide 
sequence may include one or more other proteins, jfragments or derivatives thereof 
encoded by an essential or important gene. The other polypeptide sequence may also 
include a polypeptide sequence which allows for presentation of the polypeptide 
fi:agment or derivative. 

[0064] Accordingly, tiie present invention encompasses an isolated polypeptide and 
firagments and derivatives thereof, wherein said polypeptide has at least 80% sequence 
identity to a polypeptide encoded by a nucleic acid sequence selected from the group 
consisting of the Pseudomonas aeruginosa open reading firames (ORFs) listed in Table 
1 . More preferably, the present invention encompasses an isolated polypeptide and 
firagments and derivatives thereof, wherein said polypeptide has at least about 85 to 
90% sequence identity to a polypeptide encoded by a nucleic acid sequence selected 
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from the group consisting of the Pseudomonas aeruginosa open reading frames (ORFs) 
listed in Table 1 . Even more preferably, the present invention encompasses an isolated 
polypeptide and fragments and derivatives thereof, wherein said polypeptide has at 
least about 90% to about 95% sequence identity to a polypeptide encoded by a nucleic 
acid sequence selected from the group consisting of the Pseudomonas aeruginosa open 
reading frames (ORFs) listed in Table 1. 

[0065] In particular, the invention encompasses isolated polypeptides and fragments 
and derivatives thereof, wherein said polypeptides have at least 80% sequence identity, 
or more preferably at least about 85 to 90 to 95% identity, to a polypeptide encoded by 
an essential or important nucleic acid sequence selected from the group consisting of 
the Pseudomonas aeruginosa open reading frames (ORFs) listed in Table 1, wherein 
the essentiality or importance of said nucleic acid sequence is determined by integration 
knock-out couple with extra-chronaosomal complementatioiL Likewise, the invention 
encompasses isolated polypeptides and fragments and derivatives thereof, wherein said 
polypeptides have at least 80% sequence identify, or more preferably at least about 85 
to 90 to 95% identity, to a polypeptide encoded by an essential nucleic acid sequence 
selected from the group consisting of the Pseudomonas aeruginosa open reading 
frames (ORFs) listed in Table 1, wherein essentiality or importance of said nucleic acid 
sequence is determined by integration of a regulatable promoter into the gene, or via 
any other suitable method. 

[0066] Also encompassed in the invention are therapeutic and prophylactic vaccines 
that comprise ligands that specifically bind antigens encoded by essential or important 
genes identified according to the invention, for use in, for instance, passive 
immimization. Preferred ligands are antibodies and antibody fragments that 
specifically bind the antigen encoded by the essential gene. Such antibodies may be 
polyclonal or monoclonal. Types of antibodies and antibody fragments include by way 
of examples murine antibodies, chimeric, antibodies^ humanized antibodies, Fab 
fi:agments, Fab2 fiagments and human antibodies and scFv's. Methods for producing 
antibodies and antibody firagments by recombinant and non-recombinant methods are 
well known to those skilled in the art. In some embodiments the antigen used in such 
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passive immunization may be attached to a cytotoxic moiety, e.g., a radionuclide or 
other agent that is cytotoxic against the bacteria. 

[0067] Further encompassed within the scope of the invention are cells or viral 
vectors that express on their surface a Pseudomonas aeruginosa essential gene, 
fragment or variant identified according to the invention. 

[0068] In the case of prophylactic vaccines, the vaccine w|ll comprise an 
immunogenic composition comprising a prophylactically effective amount of ah 
antigen, antibody, cells or vector expressing an antigen encoded by an essential or 
important gene and will be formulated such that upon administration it elicits a 
protective immune response. In the case of therapeutic vaccines, the vaccine will 
comprise an inmiunogenic composition comprising a therapeutically effective amount 
of an antigen, antibody, cells or vectors e5q)ressing an antigen encoded by an essential 
or important gene and will be formulated such that upon administration it elicits a 
therapeutic immune response. Dosage effective amounts of prophylactic and 
therapeutic vaccines will be determined by known methods and will typically vary from 
about 0.00001 g/kg body weight to about S-10 g/kg body weight. 

[0069] The immunogenic compositions of the invention can be administered by 
known methods, i.e., mucosally or parenterally. 

[0070] Sxiitable routes of mucosal adnunistration include oral, intranasal (IN), 
intragastric, pulmonary, intestinal, rectal, ocular, and vaginal routes. Preferably, 
mucosal administration is oral or intranasal. 

[0071] Where mucosal administration is used, tiie immunogenic composition is 
preferably adapted for mucosal administration. For instance, where the composition is 
administered orally, it may be in the form of tablets or capsules (optionally enteric- 
coated), liqvdd, transgenic plants, etc. Where the composition is administered 
intranasally, it may be in the form of a nasal spray, nasal drops, gel or powder. Where 
the antigen composition is adapted for mucosal administration, it may further be 
formulated such that the antigen remains stable, for instance by the use of carriers and 
excipients. 
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[0072] The mununogenic compositions of the invention can further comprise a 
mucosal adjuvant. Mucosal adjuvants suitable for use in the invention include (a) 
E.coli heat-labile enterotoxin ("LT'X or detoxified mutants thereof, such as tilie K63 or 
R72 mutants; (B) cholera toxin ("CT')^ ^ detoxified mutants thereof; or (C) 
miCToparticles (i.e,, a particle of -lOOnm to -ISOjxm in diameter, more preferably 
-'200nm to --SOimx in diameter, and most preferably --SOOnm to -lOum in diameter) 
fomied from materials that are biodegradable and non-toxic {e.g. a poly(a-hydroxy 
acid), a polyhydroxybutyric acid, a polyorthoester, a polyanhydride, a polycaprolactone 
etc.); (D) a polyoxyethylene ether or a polyoxyethylene ester (see International patent 
application WO 99/52549); (E) a polyoxye&ylene sorbitan ester surfactant in 
combination with an octoxynol (see International patent application WO 01/21207) or a 
polyoxyethylene alkyl ether or ester surfactant in combination with at least one 
additional non-ionic surfactant such as an octoxynol (see International patent 
application WO 01/211 52); (F) chitosan (e.g. International patent application WO 
99/27960) and (G) an immunostim\ilatory oligonucleotide (e.g. a CpG oligonucleotide) 
and a saponin (see International patent application WO 00/62800). Other mucosal 
adjuvants are also available (e.g. see chapter 7 of Vaccine design: the submit and 
adjuvant aproach, eds. Powell & Newman, Plenum Press 1995 (ISBN 0-306-44867-X). 

[00731 Mutants of LT are preferred mucosal adjuvants, in particular the "K63" and 
"R72" mutants (e.g. see International patent application WO 98/18928), as these result 
in an enhanced immune response. 

[0073] Microparticles are also preferred mucosal adjuvants. These are preferably 
derived from a poly(a-hydroxy acid), in particxilar, from apoly(lactide) ("PLA"), a 
copolymer of D,L-lactide and glycolide or glycolic acid, such as a poly(D,L-lactide-co- 
glycolide) ("PLG" or "PLGA"), or a copolymer of D,L-lactide and caprolactone. The 
microparticles may be derived from any of various polymeric starting materials which 
have a variety of molecular weights and, in the case of the copolymers such as PLG, a 
variety of lactide:glycolide ratios, the selection of which will be largely a matter of 
choice, depending in part on the coadnunistered antigen. 
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[0074] Antigen may be entrapped >vithin the ndcroparticles, or may be adsorbed to 
them. 

10075] Entrapment within PLG microparticles is preferred, PLG microparticles are 
discussed in further detail in Morris et al., (1994), Vaccine, 12:5 - 1 1, in chapter 13 of 
Mucosal Vaccmes, eds. Kiyono et al.. Academic Press 1996 (ISBN 012410587), and in 
chapters 16 & 1 8 of Vaccine design: the subunit and adjuvant aproacK eds. Powell & 
Newman, Plenum Press 1995 (ISBN 0-306-44867-X). 

[0076] LT mutants may advantageously be used in combination with microparticle- 
entrapped antigen, resulting in significantly enhanced inmiune responses. 

[0077] Suitable routes of parenteral admiiustration include intramuscular (IM), 
subcutaneous, intravenous, intraperitoneal, intradermal, transcutaneous, and 
transdermal {see e.g. ^ Intenaational patent application WO 98/20734) routes, as well as 
delivery to the interstitial space of a tissue. 

[0078] The immunogenic compositions of the invention may be adapted for 
parenteral administration (e.g., in the form of an injectable, which will typically be 
sterile and pyrogen-free). 

[0079] The immunogenic composition may further comprise a parenteral adjuvant. 
Parenteral adjuvants suitable for use in tiie invention include: (A) aluminum 
compounds {e.g. aluminum hydroxide, aluminum phosphate, alununum 
hydroxyphosphate, oxyhydroxide, orthophoi^hate, sul&te etc. (e.g. see chapters 8 & 9 
of Vaccine design: the subunit and adjuvant qproach^ eds. Powell & Newman, Plenum 
Press 1995 (ISBN 0-306-44867-X) Oiereinafter ''Vaccine design''), or mixtures of 
different aluminum compovinds, with the compounds taking any suitable form (e.g. gel, 
crystalline, amorphous etc.}, and with adsorption being preferred; (B) MF59 (5% 
Sqxxalene, 0.5% Tween 80, and 0.5% Span 85, formxilated into submicron particles 
using a microfluidizer) (see Chapter 10 of Vaccine design; see also International patent 
application WO 90/14837); (C) liposomes (see Chapters 13 and 14 of Vaccine design); 
(D) ISCOMs (see Chapter 23 of Vaccine design); (E) SAP, containing 10% Squalane, 
0,4% Tween 80, 5% pluronic-block polymer L121 , and thr-MDP, either microfluidized 



wo 03/089572 



25 



PCT/US02/35518 



into a submicron emulsion or vortexed to generate a larger particle size emulsion {see 
Chapter 12 of Vaccine design); (F) Ribi™ adjuvant system (RAS), (Ribi 
Immunochem) containing 2% Squalene, 0.2% Tween 80, and one or more bacterial cell 
wall components j&om the group consisting of monophosphorylipid A (MPL), trehalose 
dimycolate (TDM), and cell waU skeleton (CWS), preferably MPL + CWS (Detox™); 
(G) saponin adjuvants, such as QuilA or QS21 (see Chapter 22 of Vaccine design), also 
known as Stimxilon™; (HT) ISCOMs, which may be devoid of additional detergent 
(Intemational patent application WO 00/07621); (I) complete Freund's adjuvant (CFA) 
and mcomplete Freund's adjuvant (IF A); (J) cytokines, such as interleukins {e.g. IL-1, 
IL-2, IL-4, IL-5, IL-6, IL-7, IL-12, etc.), interferons (e.g. interferon-y), macrophage 
colony stimulating factor, tumor necrosis factor, etc. (see Chapters 27 & 28 of Vaccine 
design); (K) microparticles (see above); (L) monophosphoryl lipid A (MPL) or 3-0- 
deacylated MPL (3dMPL) (e.g. chapter 21 of Vaccine design); (M) combinations of 
3dMPL with, for example, QS21 and/or oil-in-water emulsions (European patent 
applications 0835318, 0735898 and 0761231); (N) oligonucleotides comprising CpG 
motifs (see Krieg (2000) Vaccine, 19:618 - 622; Krieg (2001) Curr, Opin. Mol Ther., 
2001, 3:15 ^24; WO 96/02555, WO 98/16247, WO 98/18810, WO 98/40100, WO 
98/55495, WO 98/37919 and WO 98/52581, etc.) i.e. containing at least one CG 
dinucleotide, with 5-methylcytosine optionally being used in place of cytosine; (O) a 
polyoxyethylene ether or a polyoxyethylene ester (Intemational patent application WO 
99/52549); (P) a polyoxyethylene sorbitan ester surfactant in combination with an 
octo>Q^ol (Litemalional patent application WO 01/21207) or a polyoxyethylene alkyl 
ether or ester surfactant in combination with at least one additional non-ionic surfactant 
such as an octoxynol (Intemational patent application WO 01/211 52); (Q) an 
immunostimulatory oligonucleotide (e.g. a CpG oligonucleotide) and a saponin 
(Intemational patent application WO 00/62800); (R) an immunostimulant and a particle 
of metal salt (Intemational patent application WO 00/23105); (S) a saponin and an oil- 
in-water emulsion (Intemational patent application WO 99/1 1241); (T) a saponin (e.g. 
QS21) + 3dMPL + IL-12 (optionally + a sterol) (Intemational patent application WO 
98/57659); and (U) other substances that act as iromunostimulating agents to enhance 
the effectiveness of the composition (e.g. see Chapter 7 of Vaccine design). 



[0080] Aluminium compovmds and MF59 are preferred adjuvants for parenteral use. 
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[0081] The immunognic compositions of the invention may be administered in a 
single dose, or as part of an administration regime. The regime may include priming 
and boostmg doses, which may be adnainistered mucosally, parenterally, or various 
combinations thereof. 

[0082] In some instances tiie vaccines of the invention may comprise several 
antigens, fragments or variants encoded by essential genes identified according to the 
invention. Alternatively, the vaccine may further comprise antigens identified by other 
methods, or specific to other bacteria, e.g., in order to provide multivalent vaccines. 

[0083] With respect to libraries according to the invention, a library of 
polynucleotides or a library of transposon insertion sites is a collection of sequence 
information, which information is provided in either biochemical form (e,g., as a 
collection of polynucleotide molecules), or in electronic form (e.g., as a collection of 
polynucleotide sequences stored in a computer-readable form, as in a computer system 
and/or as part of a computer program). The sequence information of the 
polynucleotides can be used in a variety of ways, for instance as a resource for gene 
discovery, i.e., for identifying and verifying essential and important genes in P. 
aeruginosa, or for identifying essential or important homologues in other genera or 
species. A polynucleotide sequence in a library can be a polynucleotide that represents 
an mRNA, polypeptide, or other gene product encoded by the polynucleotide, and 
accordingly such a polynucleotide library could be used to formulate corresponding 
RNA or amino acid libraries according to the sequences of the library members. 

[0084] The nucleotide sequence information of tilie library can be embodied in any 
siiitable fornix e.g., electronic or biochemical forms. For example, a library of sequence 
information embodied in electronic form comprises an accessible computer data file 
(or;, in biochemical form, a collection of nucleic acid molectdes) that contains the 
representative nucleotide sequences of essential and important genes and/or insertion 
mutants that are differentially expressed (e.g., attenuated growth mutants). Other 
combinations and comparisons of cells affected by various diseases or stages of disease 
will be readily apparent to the ordinarily skilled artisan. Biochemical embodiments of 
the library include a collection of nucleic acids that have the sequences of the genes or 
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transposon insertion sites in the library, where the nucleic acids can correspond to the 
entire gene in the library or to a j&agment thereof, as described in greater detail below. 

[0085] The polynucleotide libraries of the subject invention generally comprise 
sequence information of a plurality of polynucleotide sequences, where at least one of 
the polynucleotides has a sequence of any of the sequences in Tables 1-3. By plurality 
is meant at least 2, usually at least 3 and can include up to all of the sequences included 
in these tables. The length and number of polynucleotides in the library will vary with 
the nature of the library, e.g. , if the library is an oligonucleotide array, a cDNA array, a 
computer database of the sequence information, etc. 

[0086] Where the library is an electronic library, the nucleic acid sequence 
information can be present in a variety of media. "Media" refers to a manufacture, 
other than an isolated nucleic acid molecule, that contains the sequence information of 
the present invention. Such a manufacture provides the genome sequence or a subset 
thereof in a form that can be examined by means not directly applicable to the sequence 
as it exists in a nucleic acid. For example, the nucleotide sequence of the present 
invention, e,g. the nucleic acid sequences of any of the polynucleotides of Tables 1-3, 
can be recorded on computer readable media, e.g. any medium that can be read and 
accessed directiy by a computer. Such media include, but are not lintiited to: magnetic 
storage media, such as a floppy disc, a hard disc storage medium, and a magnetic tape; 
optical storage media such as CD-ROM; electrical storage media such as RAM and 
ROM; and hybrids of these categories such as magnetic/optical storage media. One of 
skill in the art can readily appreciate how any of the presently known computer 
readable mediums can be used to create a manufacture comprising a recording of the 
present sequence information. "Recorded" refers to a process for storing information 
on computer readable medium, using any such methods as known in the art. Any 
convenient data storage structure can be chosen, based on the means used to access the 
stored information. A variety of data processor programs and formats can be used for 
storage, e.g. word processing text file, database format, etc. In addition to the sequence 
information^ electronic versions of the libraries of the invention can be provided in 
conjunction or connection with other computer-readable information and/or other types 
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of computer-readable files {e,g., searchable files, executable files, etc^ including, but 
not87 limited to, for example, search program software, etc.), 

[0087] By providing the nucleotide sequence in computer readable form, the 
information can be accessed for a variety of purposes. Computer software to access 
sequence information is publicly available. For example, the gapped BLAST (Altschul 
et al Nucleic Acids Res, (1997) 25:3389-3402) and BLAZE (Brutlag et al Comp. 
Chem. (1993) 17:203) search algorithms on a Sybase system can be used to identify 
open reading frames (ORFs) within the genome that contain homology to ORFs from 
other organisms. 

[0088] As used herein, "a computer-based system" refers to the hardware means, 
software means, and data storage means used to analyze the nucleotide sequence 
information of the present invention. The minimum hardware of the computer-based 
systems of the present invention comprises a central processing unit (CPU), input 
means, output means, and data storage means. A skilled artisan can readily appreciate 
that any one of the currently available computer-based system are suitable for use in Ihe 
present invention. The data storage means can comprise any manufacture comprising a 
recording of the present sequence information as described above, or a memory access 
means that can access such a manufacture. 

[0089] "Search means" refers to one or more programs implemented on the 
computer-based system, to compare a target sequence or target structural motif, or 
expression levels of a polynucleotide in a sample, with the stored sequence information. 
Search means can be used to identify fragments or regions of the genome that match a 
particular target sequence or target motif. A variety of known algorithms are publicly 
known and commercially available, MacPattem (EMBL), BLASTN and BLASTX 
(NCBI). A "target sequence" can be any polynucleotide or amino acid sequence of six 
or more contiguous nucleotides or two or more amino acids, preferably from about 10 
to 100 amino acids or from about 30 to 300 nucleotides. A variety of comparing means 
can be used to accomplish comparison of sequence information from a sample (e,g. , to 
analyze target sequences, target motifs, or relative expression levels) with the data 
storage means. A skilled artisan can readily recognize that any one of the publicly 
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available homology search programs can be used as the search means for the computer 
based systems of the present invention to accomplish comparison of target sequences 
and motifs. Computer programs to analyze expression levels in a sample and in 
controls are also known in the art. 

[0090] A "target structural motif," or "target motif," refers to any rationally selected 
sequence or combination of sequences in which the sequence(s) are chosen based on a 
three-dimensional configuration that is formed upon the folding of the target motif, or 
on consensus sequences of regulatory or active sites. There are a variety of target 
motifs known in the art. Protein target motifs include, but arc not limited to, enzyme 
active sites and signal sequences. Nucleic acid target motifs include, but are not limited 
to, hairpin structures, promoter sequences and other expression elements such as 
binding sites for transcription factors. 

[0091] The present invention encompasses the use of the library of essential and 
important genes to search for polynucleotide and amino acid sequences in common 
among the essential and important genes. Such identified sequences can be used to 
design and develop antibacterial agents and vaccines against Pseudomonas aeruginosa. 

[0092] A variety of structural formats for the input and output means can be used to 
input and output the information in the computer-based systems of the present 
invention. One format for an output means ranks the relative expression levels of 
different polynucleotides. Such presentation provides a skilled artisan with a ranking 
of relative expression levels to determine a gene expression profile. 

[0093] As discussed above, the "library" as used herein also encompasses 
biochemical libraries of the polynucleotides of Tables 1-3, e.g., collections of nucleic 
acids representing the provided polynucleotides. The biochemical libraries can take a 
variety of forms, e.g., a solution of cDNAs, a pattern of probe nucleic acids stably 
associated wi& a surface of a solid support (i.e., an array) and the like. Of particular 
interest are nucleic acid arrays in which one or more of tiie sequences of Tables 1-3 is 
represented on the array. By '^airay" is meant a an article of manufacture that has at 
least a substrate wi& at least two distinct nucleic acid targets on one of its stirfaces, 
where the number of distinct nucleic acids can be considerably higher, typically being 
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at least 10 nt, usually at least 20 nt and often at least 25 nt. A variety of difBsrent array 
formats have been developed and are known to those of skill in the art. The arrays of 
the subject invention find use in a variety of applications, including gene expression 
analysis, drug screening, mutation analysis and the like, as disclosed in the above-listed 
exemplary patent documents. 

[0094] In addition to the above nucleic acid libraries, analogous libraries of 
polypeptides are also provided, where the polypeptides of the library will represent at 
least a portion of the polypeptides encoded by a gene corresponding to one or more of 
the sequences in Tables 1-3. 

[0095] "Identity" as it is used in the present invention should be distinguished from 
"homology" or "homologous." In the context of the coding sequences and genes of this 
invention, "homologoxis" refers to genes whose expression results ia expression 
products which have a combination of amino acid sequence similarity (or base 
sequence similarity for transcript products) and functional equivalence, and are 
therefore homologous genes. In general such genes also have a high level of DNA 
sequence similarity (i.e., greater than 80% identity when such sequences are identified 
among members of the same genus, but lower when these similarities are noted across 
bacterial genera), but are not identical. Relationships across bacterial genera between 
homologous genes are more easily identified at the polypeptide (i.e., the gene product) 
rather than the DNA level. The combination of functional eqtdvalence and sequence 
similarity means that if one gene is usefiil, e.g., as a target for an antibacterial agent, or 
for screening for such agents, then the homologous gene is probably also useful, but 
may not react in the same manner or to the same degree to the activity of a specific 
antibacterial agent. 

[0096] Nevertheless, the identification of one such gene serves to identify a 
homologous gene through the same relationships as indicated above, and can serve as a 
starting point to determine whether the homologous gene is also essential, whether it 
responds to the same antibacterial agents, etc. Typically, such homologous genes are 
found in other bacterial species, especially, but not restricted to, closely related species. 
Due to the DNA sequence similarity, homologous genes are often identified by 
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hybridizing with probes from the initially identified gene under hybridizing conditions 
that allow stable binding under appropriately stringent conditions. For instance, nucleic 
acids having sequence similarity are detected by hybridization xmder low stringency 
conditions, for example, at SO^'C and lOXSSC (0.9 M saline/0.09 M sodium citrate) and 
remain bound when subjected to washing at 55°C in IXSSC. Sequence identity can be 
determined by hybridization under stringent conditions, for example, at 50^C or higher 
and 0. IXSSC (9 mM saline/0.9 mM sodium citrate). Hybridization methods and 
conditions are well known in the art, see, e,g.^ USPN 5,707,829. Nucleic acids that are 
substantially identical to the provided polynucleotide sequences, e.g, allelic variants, 
genetically altered versions of the gene, e/c, bind to the provided polynucleotide 
sequences under stringent hybridization conditions. By using probes, particularly 
labeled probes of DNA sequences, one can isolate homologous or related or 
substantially identical genes. The equivalent function of the product is then verified 
using appropriate biological and/or biochemical assays. 

[0097] Using such hybridization technique for the identiGcation of homologous 
genes, it will be possible to screen other species of bacteria, particularly other genera of 
gram negative pathogenic bacteria although gram positive bacteria may also be 
screened, to determine if any essential or important gene identified herein has a 
homologue in that particular genus of bacteria. If so, such gene could be cloned and 
isolated for essentiality in the particular genus, and further tested for sensitivity or 
susceptibility to the antibacterial agents and inhibitors identified herein. Specific 
genera of bacteria particularly appropriate for hybridization screening for the presence 
of homologues of essential and important genes include Escherichia, Hemophilus, 
Vibrio, Borrelia, Enterococcus, Heliobacter, Legionella, Mycobacterium, Mycoplasma, 
Neisseria, Staphylococcus, Streptococcus, etc. 

[0098] "Identity," on the other hand, is gauged firom the starting point of complete 
homology. Thereafter, identity may be described in temis of percentages according to 
the mmiber of base changes ia the DNA sequence taking into account any gaps. For 
purposes of the present invention, variants of the invention have a sequence identity 
greater than at least about 65%, preferably at least about 75%, more preferably at least 
about 85%, and can be greater than at least about 90% or more as detemiined by tihe 



wo 03/089572 



32 



PCT/US02/35518 



Smifh-Wateiman homology search algorithm as implemented in MPSRCH program 
(Oxford Molecular). A preferred method of calculating percent identity is Ae Smith- 
Waterman algorithm, using the folloAving. Global DN A sequence identity must be 
greater than 65% as determined by the Smith- Waterman homology search algorithm as 
implemented in MPSRCH program (Oxford Molecular) using an affine gap search with 
the following search parameters: gap open penalty, 12; and gap extension penalty, 1. 

(00991 Amino acid sequence variants are also included in the invention. Preferably, 
naturally or non-naturally occurring protein variants have amino acid sequences which 
are at least 85%, 90%, or 95% identical to the amino acid sequences identified herein, 
or to a shorter portion of these sequences. More preferably, the molecules are 98% or 
99% identical. Percent sequence identity is determined using the Smith- Waterman 
homology search algorithm using an a£Gne gap search with a gap open penalty of 12 
and a gap extension penalty of 2, BLOSUM matrix of 62. The Smith- Waterman 
homology search algorithm is taught in Smith and Waterman, Adv. Appl Math. (1981) 
2:482-489. 

[0100] Also included in the invention are fragments of the nucleic acid sequences and 
amino acid sequences identified herein, as well as RNAs and RNA firagments 
corresponding to the DNA sequences disclosed. Such nucleic acid firagments are at 
least about 10 nucleotides, more preferably at least about 20 to 25 nucleotides, and 
more preferably at least about 50 to 100 nucleotides, and can include any fragment or 
variant of a fragment Such nucleic acid fragments may be used as probes for 
identifying similar or substantially identical or identical nucleic acid sequences in other 
genera, or as tools in constructing nucleic acid vectors for knock out and promoter 
swap experiments. Such amino acid fragments are at least about four amino acids in 
length, more preferably at least about 8 to 12 amino acids in length, and more 
preferably at least about 20 to 30 amino acids in length, and may be used as agonists or 
antagonists to test binding interactions of the proteins disclosed herein, or alternatively 
as ixnmunogens to isolate antibodies that recognize and bind to specific epitopes of a 
target protein. 
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[0101] Once a gene is identified as being essential or important for Pseudomonas 
growth on rich media or in any specific environment, the invention also encompasses 
the identification of antibacterial agents that have specific activity against the essential 
or important genes or their gene products or the biochemical pathways in which they 
are involved. In this context, the term "biochemical pathway" refers to a connected 
series of biochemical reactions normally occurring in a cell, or more broadly a cellular 
event such as cellular division or DNA replication. Typically, the steps in such a 
biochemical pathway act in a coordinated fashion to produce a specific product or 
products or to produce some other particular biochemical action. Such a biochemical 
pathway requires the expression product of a gene if the absence of that expression 
product either directly or indirectly prevents the completion of one or more steps in that 
pathway, thereby preventing or significantly reducing the production of one or more 
normal products or effects of that pathway, 

[0102] Hius, an agent specifically inhibits such a biochemical pathway requiring the 
egression product of a particular gene if the presence of the agent stops or 
substantially reduces the completion of the series of steps in tiiat pathway. Such an 
agent, may, but does not necessarily, act directly on the expression product of that 
particiilar gene. An "expression product" of a gene means that, in a bacterial cell of 
interest, the gene is transcribed to form RNA molecules. For those genes that are 
transcribed into mRNAs, the mRNA is translated to form polypeptides. More 
generally, in this context, "expressed" means that a gene product is formed at the 
biological level tixat would normally have the relevant biological activity (i.e., RNA or 
polypeptide level). 

[0103] Thus, the invention includes a method of screening for an antibacterial agent, 
comprising determining whether a test compound is active against an essential or 
important bacterial gene identified by the methods herein. The invention also includes 
a method of screening for an antibacterial agent, comprising determining whether a test 
compound is active against a protein encoded by an essential bacterial gene identified 
herein, or active to inhibit the biochemical pathway that involves said protein. The 
term "antibacterial agent" refers to both naturally occurring antibiotics produced by 
microorganisms to suppress the growth of other microorganisms, and agents 
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synthesized or modified in the laboratory which have either bactericidal or 
bacteriostatic activity. An "active" agent in this context will inhibit the growth of P. 
aeruginosa and possibly related species. The term "inhibiting the growth" indicates 
lhat the rate of increase in the numbers of a population of a particular bacterium is 
reduced. Thus, the term includes situations in which the bacterial population increases 
but at a reduced rate, as well as situations where the growth of the population is 
stopped, as well as situations where the numbers of the bacteria in the population axe 
reduced or the population even eliminated. If an enzyme activity assay is used to 
screen for inhibitors, one can make modifications in uptake/efELux, solubility, half life, 
etc. to compounds in order to correlate enzyme inhibition with growth inhibition. 

[0104] Assays may include any suitable method and may be expected to vary on the 
type of essential gene or protein involved. For instance, one embodiment is a method 
comprising the steps of: 

a) contacting said protein or a biologically active firagment thereof with a test 
compoimd; and 

b) determining whether said test compound binds to said essential gene product 
or protein or fi:agment of said protein; 

wherein binding of said test compound to said polypeptide or said fiagment is 
indicative that said test compound is an antibacterial agent. It is quite common 
in identifying antibacterial agents, to assay for binding of a compound to a 
particular polypeptide where binding is an indication of a compound which is 
active to modulate the activity of the polypeptide. Binding may be determined 
by any means according to the agent tested and techniques known in the art. 

[0105] Also, agents that inhibit binding of two proteins or polypeptides may also be 
identified, for instance using a yeast two-hybrid system. Such a system will entail 
cloning the genes encoding each protein and expressing each in a reports: cell system 
such that interaction between the two proteins is monitored by observing the expression 
of a reporter gene. For instance, cDNAs cloned in a yeast two-hybrid expression 
system (Chien et al. (1991) Proc. Nati. Acad. Sci. (U.S.A.) 88: 9578; Zervos et al. 
(1993) Cell 72: 233) can be used to identify other cDNAs encoding proteins that 
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interact with the protein encoded by the first, thereby produce expression of the GAL4- 
dependent reporter gene. Thereafter, cells expressing both proteins leading to 
expression of the reporter gene are used to screen for agents that mteract with either 
protein, or the gene encoding either protein. Such systems are well known in the art 
and are well within the realm of ordinary skill. 

[0106] Another embodiment is a method for evaluating a test agent for inhibition of 
expression of an essential gene identified according to the methods herein, comprising: 

a) contacting a cell expressing said essential gene with said agent; and 

b) determining the amount or level of expression of said essential gene in said 
sample. 

[0107] The exact determination method will be expected to vary depending on the 
characteristics of the expression product as would be readUy apparent to one of 
ordinary skiU m the art. Such methods can include, for example, antibody binding 
methods, enzymatic activity determmations, and substrate analog binding assays. Such 
level of expression could be monitored by monitoring the level of the product of the 
essential gene in the cell, i.e., by SDS-PAGE, or by colorimetric assays using, for 
example, a lacZ gene or protein fusion and detection on media using X-Gal or 
spectrophotometric detection. 

[0108] When such fiisions are employed, fusions may be designed using the 
chromosomal gene so long as the fiision does not disrupt the fimction of the essential 
gene, i.e., as with a gene fusion where lacZ is inserted just downstream of the essential 
gene and is expressed ftom the same promoter as the essential gene. Alternatively, one 
could employ an extrachromosomal fusion construct whereby the wild type 
chromosomal copy of the gene is not disrupted. In this case, one could employ a 
protein fusion, i.e., where a portion of lacZ sufficient to be detected with a colorimetric 
test is fused in firame with the coding region of the essential gene such that a fusion 
protein is obtained. Other detectable or measurable protems commonly used m the art 
may be used as an altemative to lacZ, for instance, p/io^, LuxAuciferase, etc. 

[0109] Another method of the invention for evaluating an potential antibacterial 
agent, comprises the steps of: 
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a) providing a bacterial strain comprising a mutant or normal form of the 
essential or important gene, wherein said mutant form of the gene confers a 
growth conditional phenotype; 

b) contacting bacteria of said bacterial strain with a test compound in semi- 
permissive or permissive growth conditions; and 

c) determining whether the growth of said bacterial strain comprising said 
mutant form of a gene is reduced in the presence of said test compound to a 
greater extent than a comparison bacteria comprising a normal form of said 
gene. 

[0110] In this context, a "mutant form" of a gene is a gene which has been altered, 
either naturally or artificially, changing the base sequence of the gene, which results in 
a change in the amino acid sequence of an encoded polypeptide. The change in the base 
sequence may be of several dififerent types, including changes of one or more bases for 
different bases, small deletions, and small insertions. Mutations may also include 
transposon insertions that lead to attenuated activity, i.e., by resulting in expression of a 
truncated protein. By contrast, a normal form of a gene is a form commonly found in a 
natural population of a bacterial strain. Commonly a single form of a gene will 
predominate in natural populations. In general, such a gene is suitable as a normal 
form of a gene, however, other forms which provide similar functional characteristics 
may also be used as a normal gene. In particular, a normal form of a gene does not 
confer a growth conditional phenotype on the bacterial strain having that gene, while a 
mutant form of a gene suitable for use in these methods does provide such a growth 
conditional phenotype. 

[0111] As used in the present disclosure, the term "growth conditional phenotype" 
indicates that a bacterial strain having such a phenotype exhibits a significantly greater 
difference in growth rates in response to a change in one or more of the culture 
parameters than an otherwise similar strain not having a growth conditional phenotype. 
Typically, a growth conditional phenotype is described with respect to a single growth 
culture parameter, such as temperature. Thus, a temperature (or heat-sensitive) mutant 
(Le., a bacterial strain having a heat-sensitive phenotype) exhibits significantly reduced 
growth, and preferably no growth, under non-permissive temperature conditions as 
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compared to growth under permissive conditions. In addition, such mutants preferably 
also show intermediate growth rates at intennediate, or semi-permissive, temperatures. 
Similar responses also result from the appropriate growth changes for other types of 
growth conditional phenotypes. A growth conditional phenotype can also be conferred 
by cloning an essential or important gene behind a regulatable promoter, for instance, a 
promoter that is only active, or only leads to transcription, under particular 
enviroimaiental conditions or in response to a specific environmental stimulus. Such 
growth conditional promoter mutants may be isolated according to the promoter swap 
strategies described herein. 

[0112] "Semi-pennissive conditions" are conditions in which the relevant culture 
parameter for a particular growth conditional phenotype is intermediate between 
permissive conditions and non-permissive conditions. Consequently, in semi- 
permissive conditions the bacteria having a growth conditional phenotype will exhibit 
growtib rates intennediate between those shown in permissive conditions and non- 
permissive conditions. In general, such intermediate growth rate is due to a mutant 
cellular component which is partially functional under semi-permissive conditions, 
essentially fully functional under pemussive conditions, and is non-functional or has 
very low function under non-permissive conditions, where the level of function of that 
component is related to the growth rate of the bacteria. 

[0113] The term ^'method of screening" means that the method is suitable, and is 
typically used, for testing for a particular property or effect in a large number of 
compounds. Therefore, the method requires only a small amount of time for each 
compound tested; typically more than one compound may be tested simultaneously (as 
in a 96- well microtiter plate, or in a series of replica plates), and preferably significant 
portions of the procedure can be automated, "Method of screening" also refers to 
determining a set of different properties or effects of one compomd simultaneously. 

[0114] Because the essential and important genes identified herein can be readily 
isolated and the genes cloned into a variety of vectors known in the art, the invention 
also encompasses vectors comprising the nucleic acid sequences, open reading frames 
and genes of the invention, as well as host cells containing such vectors. Because the 
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essential genes identified herein can be readily isolated and the encoded gene products 
expressed by routine methods, the invention also provides the polypeptides encoded by 
those genes, as vs^ell as genes having at least about 50%, or more preferably about 60%, 
or more preferably about 70%, or more preferably about 80%, or more preferably about 
90%, or most preferably about 95% protein sequence identity. 

[0115] Thus, by identifying certain essential and/or important genes, this invention 
provides a method of screening for an antibacterial agent by contacting a polypeptide 
encoded by one of the identified essential or important genes, or a biologically active 
firagment of such a polypeptide, vvith a test compoimd, and determining whether the test 
compound binds to the polypeptide or polypeptide firagment. In addition, to simple 
binding determmations, the invention provides a method for identifying or evaluating 
an agent active on one of the identified essential genes. The method involves contacting 
a sample containing an e:}q[>ression product of one of the identified genes with the 
known or potential agent, and determining the amount or level of activity of the 
expression product in the sample. 

[0116] In particular, antibodies to essential and important gene products are 
anticipated to be suitable diagnostic binding and antibacterial agents. Thus, antibodies 
to the proteins encoded by the essential and important genes identified by the methods 
described herein are also included in the invention. Such antibodies may be isolated 
according to well knovm techniques in the art, i.e., Kohler and Milstein for monoclonal 
antibodies. Also included are polyclonal antibodies and antibody firagments such as Fv, 
Fab and Fab2 fragments, as well as chimeric and humanized antibodies, and human 
antibodies, i.e., made using a Xeno mouse. 

[0117] In a further aspect, this invention provides a me&od of diagnosing tiie 
presence of a bacterial strain having one of the genes identified above, by probing vsdth 
an oligonucleotide at least 15 nucleotides in length, which specifically hybridizes to a 
nucleotide sequence which is the same as or complementary to the sequence of one of 
the bacterial genes identified above. In some cases, it is practical to detect the presence 
of a particular bacterial strain by direct hybridization of a labeled oligonucleotide to the 
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particular gene. In other cases, it is preferable to first amplify the gene or a portion of 
the gene before hybridizing labeled oligonucleotides to those amplified copies. 

[0118] In a related aspect, this invention provides a method of diagnosmg the 
presence of a bacterial strain by specifically detecting the presence of the 
transcriptional or translational product of tiie gene. Typically, a transcriptional (RNA) 
product is detected by hybridizing a labeled RNA or DNA probe to the transcript. 
Detection of a specific translational (protein) product can be performed by a variety of 
different tests depending on the specific protein product. Examples would be binding of 
the product by specific labeled antibodies and, in some cases, detection of a specific 
reaction involving the protein product. Diagnostic assays find particular use in assaying 
tissue and fluid samples of patients suspect of having a Pseudomonas infection. 

[0119] Antibacterial agents identified according to the methods of the invention may 
be employed in pharmaceutical compositions. Such compositions may be administered 
to patients in order to treat an infection by or involving P. aeruginosa^ either alone or in 
combination with secondary agents targeted at, for instance virulence factors of P. 
aeruginosa^ or other bacteria that may be present in addition to P. aeruginosa. In this 
context, the term "administration" or "administering" refers to a method of giving a 
dosage of an antibacterial pharmaceutical composition to a mammal, where the method 
is, e.g., topical, oral, intranasal, inhaled, intravenous, transdermal, intraperitoneal, or 
intramuscular. The preferred method of administration can vary depending on various 
factors, e.g., the components of the pharmaceutical composition, the site of the 
potential or actual bacterial infection, the bacterium involved, and the severity of an 
actual bacterial infection. 

[0120] As used above and throughout this application, "hybridize" has its usual 
meaning firom molecular biology. It refers to the formation of a base-paired interaction 
between nucleotide polymers. The presence of base pairing implies that at least an 
appreciable firaction of the nucleotides in each of two nucleotide sequences are 
complementary to the other according to the usual base pairing rules. The exact 
firaction of the nucleotides which must be complementary in order to obtain stable 
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hybridization will vary with a number of factors, including nucleotide sequence, salt 
concentration of the solution, teno^perature, and pH. 

[0121] The term, "DNA molecule", should be understood to refer to a linear polymer 
of deoxyribonucleotides, as well as to the linear polymer, base-paired witii its 
complementary strand, forming double-strand DNA (dsDNA). The term is used as 
equivalent to "DNA chain" or "a DNA" or "DNA polymer" or "DNA sequence", so this 
description of the term meaning applies to those terms also. The term does not 
necessarily imply that the specified "DNA molecule" is a discrete entity with no 
bonding with other entities. The specified DNA molecule may have H-bonding 
interactions with other DNA molecules, as well as a variety of interactions with other 
molecules, including KNA molecules. In addition, the specified DNA molecule may be 
covalently linked in a longer DNA chain at one, or both ends. Any such DNA 
molecule can be identified in a variety of ways, including, by its particular nucleotide 
sequence, by its ability to base pair under stringent conditions with anoth^ DNA or 
RNA molecule having a specified sequence, or by a method of isolation which includes 
hybridization under stringent conditions with another DNA or RNA molecule having a 
specified sequence. 

[0122] References to a "portion" of a DNA or RNA chain mean a linear chain which 
has a nucleotide sequence which is the same as a sequential subset of the sequence of 
the chain to which the portion refers. Such a subset may contain all of the sequence of 
the primary chain or may contain only a short» sequence. The subset will contain at 
least 15 bases in a single strand. However, by "same" is meant "substantially the 
same"; deletions, additions, or substitutions of specific nucleotides of the sequence, or a 
combination of these changes, which affect a small percentage of the full sequence will 
still leave the sequences substantially the same. Preferably this percentage of change 
will be less than 20%, more preferably less than 10%, and even more preferably less 
than 3%. "Same" is therefore distinguished firom "identical"; for identical sequences 
there cannot be any difference in nucleotide sequences. 

[0123] As used in reference to nucleotide sequences, "complementary" has its usual 
meaning firom molecular biology. Two nucleotide sequences or strands are 
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complementary if they have sequences that would allow base pairing between the 
strands according to the usual pairing rules. This does not require that the strands would 
necessarily base pair at every nucleotide; two sequences can still be complementary 
with a low level of base mismatch such as that created by deletion, addition, or 
substitution of one or a few (up to 5 in a linear chain of 25 bases) nucleotides, or a 
combination of such changes. 

[0124] Other embodiments of the invention will be immediately envisaged by those 
of skill in the art upon reading the methods and examples to follow. Such examples are 
merely illustrative of the invention, and should not be construed as limiting the scope of 
the invention in any way. 

Methodology 

Generation of Transposon Library 

[0125] Transposon insertions were generated using an improved transposon system 
for P. aeruginosa that utilizes a naini-TnS-type transposon on a delivery vector that 
does not replicate in Pseudomonas. The delivery vector contains a modified 
transposase gene with three amino acid substitutions that have been shown to increase 
the jfrequency of Tn5 insertions. Weinreich et al., 1994, Evidence that cis preference of 
the Tn5 transposase is caused by nonproductive multimerization. Genes Dev. 8(19): 
2363-74. The Tn5 transposase was placed under control of a lac promoter and the 
complete transposable element was minimized to 1.7 kilobases in length, including a 
tetracycline resistance marker and transcription terminator to prevent read-through into 
the genome. The transposon vector is delivered to P, aeruginosa via conjugation firom 
a suitable coli host (e.g. SMlOA/7/r). Following conjugation, transposon mutants are 
selected by resistance to tetracycline conferred by the trasnposable element. 

[0126] Libraries were created in both P. aeruginosa PAK and PAOl . The average 
diversity of the libraries created using this strategy is estimated to be --40,000 to 
-50,000 independent mutants per conjugation. Care is taken to minimize passage of 
each transposon conjugation before plating for mutant selection in an effort to minimize 
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the potential for siblings, i.e., by stopping the conjugation after sufficient time for a 
single round of conjugation events. 

Higih-Throu^put Transposon Insertion Mapping (HTTIM) 

[0127] Precise transposon insertion sites were determined by an anchored, semi- 
random PCR method for amplijacation of the transposase/genome junction region. 
O'Toole and Kolter, 1998, Initiation of biofilm formation in Pseudomonas fluorescens 
WCS365 proceeds via multiple, convergent signaling pathways: a genetic analysis, 
Mol. Microbiol. 28(3): 449-61 . The technique, HTTIM, uses both Tn5 specific and 
semi-random primers with conserved primer tails. A small aliquot of transposon 
mutant liquid culture is used as a template and amplification of a fragment containing 
an insertion site is achieved in a two-step process. The PCR product is then sequenced 
and the insertion site is entered into an Oracle database for analysis. To date, more than 
10,000 to 14,000 insertions have been mapped, each insertion representing the 
disruption of a gene or intergenic region that is not essential for survival on rich media. 

[0128] With every insertion added to the map, the regions of the genome containing 
essential genes, and particularly tiiose containing operons containing essential genes 
(because of potential polar effects of insertions in upstream genes), begin to become 
apparent because these regions will not be able to accommodate transposon insertions. 
Table 1 shows a listing of the open reading frames identified as existing between 
transposon insertions, as weU as an indication of whether the gene has homologues that 
have been identified in other bacteria pursuant to BLAST sequence database analysis. 
Open reading frames were tentatively assigned names prior to being identified pursuant 
to HTTIM analysis, as disclosed in the Pseudomonas genome project, and reported in 
Stover et al.. Complete genome sequence of Pseudomonas aeruginosa PAOl, an 
opportunistic pathogen, August 21, 2000, Nature 406: 959-964, herein incorporated by 
reference in its entirety. 

[0128] For instance, the predicted ORFs were examined individually for (1) identity 
with known genes of P. aeruginosa with sequences deposited in GenBank, (2) 
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similarity with well-characterized genes from other bacteria, or (3) presence of known 
functional motifs (see http://www.pseudomonas.com for complete list). In each case 
the literature was searched to ensure that the proteins encoded by the homologous 
genes were functionally characterized to avoid the perpetuation of poorly supported 
functional assignments. In addition^ 61 researchers who were members of the P. 
aeruginosa research community or had experience in particular aspects of bacterial 
physiology were enlisted for the Pseudomonas Community Annotation Project 
(PseudoCAP) to provide expert assistance and confirmatory information in the genome 
project for the analysis of identified ORFs and assigned functions. 

[0129] The genome project was able to assign a functional class to 54.2% of ORFs. 
As in other bacterial genomes, a large proportion of the genome (45.8% of ORFs) 
consists of genes for which no function could be determined or proposed (confidence 
level 4). Of these, nearly a third (769 ORFs) possess homology to genes of unknown 
function predicted in other bacterial genomes, and the remainder (32% of ORFs) do not 
have strong homology with any reported sequence. The 372 ORFs fiiom the entire 
genome analysis that are known P. aeruginosa genes vsdth demonstrated functions 
(confidence level 1) are primarily genes encoding lipopolysaccharide biosynthetic 
enzymes, virulence factors, such as exoenzymes and the systems that secrete them, and 
proteins involved in motility and adhesion. ORFs with strong homology to genes in 
other organisms with demonstrated functions (confidence level 2; 1,059 ORFs) include 
those required for DNA replication, protein synthesis, cell-wall biosynthesis and 
intermediary metabolism. 

[0130] The ORFs that provided the most new information about P. aeruginosa 
biology via the genome annotation were those that could be assigned a probable 
function on the basis of similarity to established sequence motifs, but could not be 
assigned a definite name (confidence level 3; 1,590 ORFs)^ Most of these genes 
encode products that are in one of three functional classes: putative enzymes (405 
genes), transcriptional regulators (341 genes) or transporters of small molecules (408 
genes). In some cases genonuc context provided additional infomiation, allowing us to 
identify loci that appear to encode systems such as metabolic padiways and secretion 
systems, although the substrates for such systems could not be identified. The system 
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for assigning name and putative function to each essential or important gene was 
gleaned jfrom the Pseudomonas genome project data already available. 

Statistical Analysis of Putative Essential and Important Genes 

[0131] The open reading firames listed in Table 1 are also presented in Table 2^ 
wherein the ORFs are listed in order of length of base pairs from longest to shortest 
Also listed in Table 2 is the probability of essentiality assigned to each of the open 
reading frames. Probability correlates with length of the ORF, such that the longer the 
ORF, the higher fhe probability of hitting the ORF in a random transposon mutagenesis 
experiment, and the higher the confidence level that the ORF represents an essential or 
an important gene given that no transposon insertions therein were isolated. Statistical 
confidence levels in essentiality or importance can help narrow the focus in the 
screening of specific genes, thereby shortening the verification process and the 
subsequent identification of antibacterial agents specific for that gene or gene product. 
Thus, one of the benefits of the HTTIM approach is that it is a quantitative approach 
that lends itself well to statistical analysis. 

[0132] The High-Througjiput Transposon Insertion Mapping (HTTIM) strategy 
utilizes a transposon, which is a small, mobile DNA element that randomly inserts into 
the chromosome. Although HTTIM was performed using a Tn5 transposon, any 
transposon may be employed so long as its insertion into the chromosome is random, 
i.e., devoid of hot spots. Reznikoff, W.S., 1993, The Tn5 transposon, Annu. Rev. 
Microbiol. 47: 945-63. Although the Tn5 derivative employed here contained a 
modified transposase gene with three amino acid substitutions that have been shown to 
increase the frequency of TnS insertions {see supra\ the frequency of insertion is 
generally quite low. For instance, mutants with even one insertion occur at a rate of 
only 1 in 10^ or 10^ bacteria, and must be specifically selected from a background of 
cells with no insertions. Because the fi?equency of a single insertion is so low, tiie 
frequency of a double insertion is so low as to be insignificant. 

[0133] When the transposon insertion disrupts one of die 5570 genes in the 
Pseudomonas genome, the fiinction of that gene is lost. If the disrupted gene is 
essential for growth, the transposon insertion mutant dies and caimot be characterized. 
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If the transposon disrupts a gene that is non-essential, the mutant survives, grows and 
the transposon insertion site is mapped. By examining the insertion sites of a large 
number of transposon mutants, all, of the non-essential P, aeruginosa genes can be 
identified, and by implication, all of the essential genes may be identified as well. 
Characterization of over 13,000 transposon insertions revealed insertions in 3890 genes 
and resulted in an even distribution of inseitions across the entire length of the genome. 
The remaining 1658 genes, in which a transposon insertion has never been observed, 
are candidates of essential genes (30%). See Figure 7, showing a graph illustrating 
OKF coverage by Tn5 achieved in High-Throughput Transposon Insertion Mapping 
(HTTINlO, wherein 30% of the genes in the genome are candidate essential genes where 
ORF size is not taken into account in predicting essentiality. 

[0134] Because insertion of the transposon used here into the chromosome was 
proposed to be random, it was possible that some of the 1658 genes that did not receive 
a transposon insertion were simply not hit by random chance. One cannot truly know 
tiiat a transposon has no hot spots and is entirely random until the data is analyzed, and 
the data here confirmed that the Tn5 derivative employed underwent random insertion 
in P. aeruginosa. Thus, the chance that a gene will not be hit by the transposon as a 
matter of random chance increases as the length of the gene decreases, particularly for 
very small genes (< 600 base pairs). See Figure 8, Probability of Being an Essential 
Gene Given No Hit. Thus, by deleting smaller ORFs (< 600 base pairs) in which there 
is a lower confidence in essentiality, the probability of essentiality goes up while the 
number of predicted essential genes decreases. Further, the curve in the graph depicted 
in Figure 8 should level off faster. Thus, in predicting the essentiality of genes fiom 
the H l'l lM candidate set, the closer one can come to a probability of 1 .0 as depicted in 
Figure 8, the high^ the confidence level of essentiality that can be assigned to each 
gene in the candidate subset For a representation of the number of ORFs of various 
lengths in P, aeruginosa^ see the histogram in Figure 9. 



[0135] A Bayessian statistical model for truncated counting data was applied to the 
candidate essential gene set, and permitted a determination that 16 to 17 percent of P. 
aeruginosa genes are essential. Such a model may therefore be utilized to increase the 
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Statistical confidence that a given gene in the candidate subset is essential. An 
exemplary statistical naodel is provided in Example 1. 

Physical Methods for Target Gene Validation 

[0136] While the above methodology and the database of putative essential and 
important gene candidates established thereby is beUeved to be superior to existing 
methods with regard to the quantity of experimentation required to identify essential 
and important genes in Pseudomonas aeruginosa and the degree of confidence 
conferred, it should be understood that the methodology described herein can be 
incorporated into combined protocols with technology known in the art. For instance, 
the methods for verifying essentiality disclose in WO 01/07651, herein incorporated by 
reference in its entirety, would be useful as a secondary method to be utilized in 
combination with the methods described in this disclosure. Alternatively or 
additionally, one of several approaches may be used to determine whether a particular 
gene is essential (absolutely required for survival on rich medium) or important (the 
absence of which resvdts in attenuated growth) to P. aeruginosa. 

Integration Knockouts 

[0137] This is the simplest and most rapid strategy. PGR is used to amplify a small 
(200-500 base pairs) portion of the coding sequence, or open reading firame (ORF) of 
the gene of interest This gene firagment must be centrally located within the ORF— it 
cannot include either termini of the gene's coding region. This firagment is cloned into a 
plasmid vector that can replicate in coli^ but not in Pseudomonas. The vector used 
should have a drug resistance marker that is suitable for selection in Pseudomonas^ and 
an origin for conjugal transfer. This feature allows the plasmid to be transferred by 
conjugation from a suitable E. coli donor strain to a Pseudomonas strain when the two 
are co-cultured under the appropriate conditions. 
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[0138] Following conjugation the co-cultured mixture is harvested and plated on 
media which selects against the E. coli donor and for Pseudomonas which contain the 
plasmid. Since the plasmid is incapable of extra-chromosomal replication m 
Pseudomonas 9 colonies that arise are the result of homologous recombination between 
the Pseudomonas chromosome and the cloned gene fragment on the plasmid. This is 
referred to as single-crossover recombioation; a single recombination event takes place 
between the plasmid and the chromosome. The result is integration of the plasmid into 
the bacterial chromosome and disruption of the gene from which the fragment was 
amplified (Fig. 1). 

[0139] Variations of this approach are possible. For instance, one could clone out the 
entire locus and isolate transposon insertion mutants in E.coli using known techniques, 
i.e., by transposition from the E. coli genome, selecting plasmid insertions by 
mobilizing the vector into a recipient cell that does not contain the transposon or the 
antibiotic resistance marker encoded by the transposon, and screening the plasmid for 
insertions in the cloned gene. Thereafter, a sinular assay could be performed by 
screening for double crossover events in P. aeruginosa that result in recombination of 
the transposon into the chromosomal locus from a suicide vector. 

[0140] Integration of the plasmid or other insertion at the locus can be confirmed by a 
relatively rapid PCR-based screen of recombinant colonies. The advantage of this 
strategy, particularly the plasmid single crossover strategy, is that it requires only 
amplification of a short stretch of DNA followed by a single cloning step before 
recombination experiments can be performed. The disadvantage is that if the target 
gene is essential, no recombinants can be obtained. Failure to obtain recombinants as 
proof of essentiality is pretty thin evidence. However, if a gene is in fact non-essential, 
this method will demonstrate that quickly. 
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Integration Knockouts with Extra-chromosomal Complementation 

[0141] This variation of the above metiiod provides more convincing data when the 
target gene is essential. It employs the same type of non-replicating integration plasmid 
described above, but recombinations are performed in strains already carrying a second 
copy of the target gene on an extra-chromosomal plasmid. This second copy can then 
supply the essential function when the chromosomal copy is dismpted. If disruptions 
can only be obtained when a complementmg plasmid is present and not when a control 
plasmid is present, this is rather strong evidence that the target gene is essential. The 
advantage of this method is that you obtain colonies even when yoxir gene is essential. 
The disadvantage is that construction and sequencing of the complementation plasn[ud 
takes additional time. 

Integration with a Regulatable Promoter (Promoter Swap) 

[0142] This approach also involves selecting for chromosomal integration of non- 
replicating plasmids via homologous recombination. However, the design of the 
integrating plasmid is different. In this case, the N-terminal coding sequence (300-500 
base pairs) of the target gene is PGR amplified and cloned into a vector downstream of 
a regulatable promoter, i.e., a lac promoter, which is inducible in the presence of IPTG, 
or an arabinose promoter (pABD), inducible in the presence of arabinose. The activity 
of the promoter can be modulated by the presence of a specific inducer molecule. The 
plasmid is conjugated into Pseudomonas and integration selected for imder conditions 
where the regulatable promoter is active. The resulting chromosomal integration 
replaces the target gene's natural promoter with the regulatable promoter from the 
plasmid (Fig. 2). If the target gene is essential, recombinants can only survive when 
the inducer molecule is present in their growth media to stimulate gene expression. If 
the gene is non-essential, the recombinant's growth is independent of the addition of the 
inducer. The advantage of this strategy is that it requires only amplification of a short 
stretch of DNA followed by a single cloning step before recombination e^qperiments 
can be performed. 
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Examples: Essential Genes Identified 

Example 1: A Bayessian Statistical Model for Increasing Statistical Confidence of 
Essentiality 

[0143] When the Tn5 transposon inserts into the Pseudomonas DNA, one of three 
things happen: 1) The insertion disrupts a nonessential gene. The cell survives to be 
characterized and the location of the insertion is determined. 2) The insertion disrupts 
an essential gene. The cell does not survive and the insertion site is not determined. 3) 
The insertion is in an intergenic region (between genes) and no information is gained. 
Genes with identified insertions are nonessential genes. However, genes without 
identified insertions could be essential genes or nonessential genes with zero 
transposon insertion. To determine the number of essential genes» we have developed a 
multivariate Bayession model for truncated Poisson data and applied it to the 
. Pseudomonas genome data set. A likelihood gain based searching algorithm was 
developed to obtain maximum likelihood estimates. The property of the algorithm was 
studied. Different approaches were compared for both multivariate and univariate 
approaches. 

A. Structure of Ifae Data and Preliminary Considerations 
« 

[0144] A transposon Tn5 insertion mutagenesis library was constructed in 
Pseudomonas aeruginosa strains PAK and PAOl. Mutants were randomly picked and 
their genomic insertion site sequence determined througfh polymerase chain reaction 
(PGR) and automated DNA sequencing. BLASTN analysis of transposon/genome 
junction sequences was used to map the location of the insertions relative to the 
completed strain PAOl genome sequence. More than 20,000 mutants were analyzed 
which resulted in 12,219 independent insertions being mapped. In order to identify 
essential genes, transposon insertion sites were analyzed with respect to the protein- 
encoding genes in this organism. A data set consists of tiie ID of genes, tiieir length in 
DNA base-pEiirs, and the number of transposon insertions were obtained firom 



wo 03/089572 



SO 



PCT/US02/3S518 



experiments. The data set consists of 5570 genes with 881 different sizes ranging from 
72 to 16884 DNA base-pairs. The distribution of the gene sizes are extremely skewed 
to the right with majority of the genes being smaller than 2000 DNA base-pairs as 
shown in Figure 10. 

[0145] A randomly selected subset of the data is shown in Table 4, where 6 is gene 
size, X is the observed number of transposon insertions. Insertions to essential genes are 
not observable since the insertion mutants can not survive for characterization when the 
transposon is inserted into an essential gene. Therefore, a gene with zero observed 
transposon insertions can either be an essential gene or a nonessential gene with zero 
transposon insertion. Consequently, the coimt of transposon insertions x is truncated 
with the truncation region being a single element {0}. 

Table 4: A sample of the gene data set 



Gene id S x 

298 1359 3 

4047 618 0 

1170 735 1 

4953 1044 1 

5526 213 0 

4624 1707 4 

5069 426 3 
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[0146] Since the insertion into the chromosome of Pseudomonas aeruginosa is 
random (Reznikoff WS. 1993), and the probability of receiving an insertion for a given 
gene is proportional to its size measured in DNA base-pairs, the nimiber of transposon 
insertions into a gene is distributed as truncated Poisson with parameter XS, where 5 is 
the size of the gene and A, is an imknown parameter, which is independent of gene size. 

B. A Bavessian Model 

[0147] Let R be a measurable subset of the probability space Q such that a random 
variable X is observable only if Xe Q\R. In this example, no observations can be 
obtained firom essential genes, whereas only nonzero observations can be obtained from 
nonessential genes, the set R consists of a single element {0}. 
1. 

a. One Gene Size 

[0148] Assume all genes in a genome have same size, 8, and let N be the number of 
nonessential genes in this genome. Then the observations Xi, X2, ... , Xn from the N 
nonessential genes are i.i.d. Poisson(X-8), of which, all observations of value zero are 
truncated. The product indicates that the probability of a gene receiving an insertion 
is proportional to its size. 

[0149] Let {x\,xl,^'>,xl)c,{x^,X:,,^",Xj,] denote the subset of all nonzero 

observations. Then this subset composes a random sample of size n from a truncated 
Poisson distribution whose distribution frmction can be written as 



/(x,A.^) = e-*^i^/(l~e-^'), x = l,2,... (3.1) 
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[0150] Let g = 1- e-** denote the probability that an observation from Poisson(X.-8) is 
not truncated, and let p=l-q=e-'^ . Then, conditional on the parameters n and N, the 
likelihood function of the joint distribution of{x:X2,"X„} can be written as 

i(;i|«.Ar)=(;i.<j)l,<[^J(nA:;!)"'. (3.2) 
[0151] Let S=xI+x;+-'+jC (3-3) 

denote the sum of all nonzero observations and notice that n follows a binomial 
distribution BCN, q). The likelihood function of the joint distribution of {nXiX^—X} , 
conditional on the parameter N, can be obtained as 



(3.4) 



[0152] The Bayesian model consists of the conditional model (2.4) and a prior 
distribution of the parameter N. Assuming N, the number of nonessential genes, is 
binomial B(M, y), where M is the total number of genes of size 6, which is known, and 
y is the portion of nonessential genes yMch is unknown and is independent of gene 
size, we can write the likelihood function of the joint distribution of {n,NXX2, ->K} 



as 



i^Cr. A.Ar)cc y (1- rf-" A^e-^*'^. (3.5) 



[0153] This is the likelihood function of n nonzero observations from M genes of the 
same size 5, of which N genes are nonessential. It is easy to see that (3.5) is 
proportional to the likelihood fimction of the posterior distribution of N given 
observations n and S. 



2. 



b. Multiple Gene Sizes 
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[0154] For' a given genome consists of genes of different sizes, let 

S ^{Si,S2,*"ySgf denote the vector of g different gene sizes, and let M=(m„M2, -,M^)'^ 

the vector of known numbers of total genes, N-^{N^,N^,"',Ngf tiie unknown numbers 
of nonessential genes, w=(n^,Wj, -,w^)^ the numbers of nonzero observations j&om the 
nonessential genes, and S= [8^,82,- -^S^f the sums of nonzero observations, as defined 
m(3.3). 

[0155] The likelihood function of the joint distribution of {n,N,s} can be written as 

L{r,^,Nh r'^(i-rr^" A"«e-*<^^)rt(^^' Y' ] (3.6) 
where H*]] is the Li norm of a vector, and S^'-N^^Si-N^ . 

[0156] Let 3 = ln(X) . Then up to an additive constant, the log likelihood function of 
the joint distribution of [n,N,s] can be written as 

3(y,^i^) HI 11- ln(r)+ (II M\\-\\N \\} ]n{l-r) 

+ II S II • ln(A) -X^S'^'N) (3.7) 

where (r.^^N) are the parameters of interests. The vector N is defined on 
{n^^Nf^ M, : / := l,2,--,g} and :^[r»Jl,N) is proportional to the likelihood function of the 
posterior distribution of N given S and s . 



wo 03/089572 



54 



PCT/US02/35518 



[0157] When g is large, say, in the order of hundreds as in the situation we are 
dealing with in this paper, obtaining the maximxun likelihood (ML) estimate of 

N^[N^,N^,'",N^f from (3.7) in such a high dimensional parameter space is a very 

difficult task both theoretically and computationally. In the next section, we will 
present a stepwise, maximum likelihood gain based me&od to obtain tibe ML 
estimation. 

C. ML ESTIMATION OF PARAMETERS 

[0158] For any N={Ni,N^,''',Nj' , it is easy to verify using (3.7) that the ML 
estimations of the parameters y and X are 

r=^\\N\]/\\M\\ (4.1) 

and 

X=^\\S\)/(s'-n) (4.2) 
respectively. Substituting (4.1) and (4.2) for y and X in (3.7), we have 

3-(/?)oc|| N \lln{\\ N ||)+(|| M\\-\\N b(l| M\\-\\N |l) 
- II 5 11. ln(^'' . iv)- f (ln((M, - N,y) + \n{{N, - /^)!)). 

[0159] For 1^1^^, define 

A^\N)=3\N+\y3\N) (4.4) 

for any Ne{n,<,N,<M,,nj^Nj^Mj: / 5^ y} . In equation (4.4), ^=(0,--0,l,0,-,0)'* with 1 
at tbe i^ position. For notational puxpose, let 



tj(k)^k>\n(k)+(\\M\\ -^). InQI M \\ -A) (4.5) 



wo 03/089572 



55 



PCT/US02/35518 



for ||/i||^^<||ji/||. Then, (3,4) can be written as 



(4.6) 



[0160] To obtain ML estimation of N, define an operator, ©, between the 
observed vector n and any integer k with 0 :s ^ ^| M || - 1| w 1| as follows: 

/5©1={»+^ :A^*(w)^A/J*(«)/ora// J^i]yand (4.7) 
n©ifc = (ne(ifc-l))©l fork^2. 

We also define a likelihood-gain fimction G with G(0)=0 and 
G(ifc) = 3*(iieik)-3*(H©(A-1)) (4.8) 

for l^*^l|Ml|-||n||. 

[0161] Using this likelihood-gain function, we can search the ML estimation for N as 
follows: 

1 . Start with the observation fi as the initial estimate of N , and denote it as N** . 
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2. For each gene size Sj with ni < Mi, i=l, 2, . . g, calculate a likelihood difference 
A^'(jV°)=3*(jV° + i;)-3*(iV*') bysetNf = ;^+l and Nj = /% for all >i. 

3. Update the mitial values by setting N? = N? + l such that 
A^*(j^*')=:max{Ap'(iV®),y=l,2,—,g). This maximum likelihood dijBFerence is the 
likelihood gain defined in (4.8)- 

4. Repeat the process until it converges. By convergence we mean that either the 
estimated number of nonessential genes equals to the number of genes in each size 
group or when increasing the ntimber of nonessential genes in any size groups will 
result in a loss of likelihood. 

[0162] This algorithm searches the ML estimator in a high dimensional space (881 in 
our study) along a path such that at each iteration, it moves in a direction (that is, 
increases the number of nonessential genes in this size group by one) along which the 
likelihood gain is maximum among all possible directions. Because the searching 
algorithm prohibits reversal of previous moves at any later iteration, it moves towards 
the ML estimator along the shortest path with the deepest ascending (maximum 
likelihood gain) at each step. Table 5 and Figures 1 1 and 12 show the values of 
likelihood gains in each iteration. With very few exceptions where the monotonous is 
violated only at the fourth or fifth decimal places that probably can be attributed to 
rounding errors, the likelihood gain is a monotonously decreasing function. 

Table 5. A Sample of Likelihood Gains at Each Iteration 



Iteration id 6 M n n(i) G(i) 

1 28 210 13 2 3 2.67559 

2 60 306 14 3 4 2.41082 
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44 258 14 5 6 2.34388 
63 315 15 5 6 2.29243 



18 32 222 7 1 2 2.05160 

19 81 369 11 2 3 2.05166 



774 122 492 12 8 11 0.00692 

775 266 924 16 14 15 0.00544 

776 85 381 14 3 11 0.00531 



The following three theorems show that the estimates obtained through the above 
algorithm are indeed the maximum likelihood estimates. 



THEOREM 1: if 



^"\o, (4.9) 



then G(1)>0. 



Proof: If G(l) sO, then by (4.5), A^'(«)^0forall l^/^g, which leads to 



7(11 « ll+l)- V(ll « II)- II 5 Vl'^' • «))+ ln(M, -n,)^0 
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M,-n, (Plir'-(||MB-Il«lir*'' 



ii«iKi+vii«iir'-(i+y(ii^ii-ii^M^^ 



Using the facts that (1+ l/x)' < ^ , (1 + l/xf^' > ^ , and (l - l/xy' > for any x>0, we obtain 



This is contradictory to condition (4.9). 

For g=l, (4.9) becomes ln(«)>(Ar,+jrj+ -+Jr„)/«. Hence, when the mean of the 
observed transposon insertions is less than the log of the number of nonzero 
observations, tibte vector ncan not be the ML estimator of N and there must be 
truncated observations fix>m nonessential genes. 
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THEOREM 2: 

£i;3'(N)>A;3'(N-lj] forcdli'Aj (4.10) 

Proof: By definition in (4.5), 

^t(f±ih2(ffl = in f £±1 . JlMz^ V 0 
dx y X \\M\\-x-l} 

for any 0<x<||J&||. Hence >7(||J^^||+i)-J7(||iV|l) is an increase function of Using 
this result, we have 

Ap*(j^)-A^'(iV-y= 

(,7(11 N |l+l)-7(ll ll))-('7(ll i)-ri(\i^\\ -1)) 
- II S \\'\4\^S,I(5^ . n)Y II S ||.ln(l+<y,/(^. 

>|| S \\-^{l+d,/(d^ ■N-Sj))-\a{\+S,l(B.^ ■ JV))] > 0 

THEOREM 3: Under (4.9), for any 1 ^ y ^ ^ flwrf 1 ^ * ^ AT*, 
with A:* = max{ifc'^0:G(A)^0/ora//0^*S**}, 
if = iiffi k~lje{nj ^Nj^ Mj} , then 



3'{n®k)>3'(n®k-lj) 



(4.11) 
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Proof: This is obviously true when k=l. Assume (3.11) is true for integers U..., k. 
For integer k+1, we have 

3*(»®(A+l)-y-3*(3©*) 

= [3'(n©(A+l)-t;)-3*(«® *-!;)]+ 

[3'(»©jfc-t^)-3*(n©*)] 
<[3*(«©(*+l)-y-3'("®A-l)] 

By theorem 2, 

3*(»©(it+i)-V)-3'(/i®*-i,)<3*("®(*+0)-^*(«®*) 
Therefore 

3'(3®(*+l))>3*("®(*'^^)"*^) 
Combining theorems 1-3, we obtain 

THEOREM 4: If the likelihood function defined in (3.7) has an unique solution, tl 
ML estunator of i5t is: 



(4.12) 
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[0163] Theorem 3 guarantees that the trajectory of the searching algorithm follows 
the shortest path in the sense that a reversal of a previous move (that is, removal of a 
previously added nonessential gene of any gene size) at any later state wiU result in a 
loss of likelihood. This property is illustrated in Figure 4 which shows the trajectory of 
the searching algorithm projected in a subspace spamed by two diEferent gene sizes. 
For illustration puipose, genes are grouped into 143 groups by grouping genes with 
similar sizes together to increase the lengtii of the trajectory. As indicated in the plot, at 
any state, moving backwards in any direction results in a loss of likelihood. Figure 13 
shows more trajectories projected in different subspaces. 

[0164] Now we need to demonstrate that the likelihood function (3.7), which is 
defined in a high dimensional discrete space, has an unique solution. This can be 
established if the same estimations are obtained from different initial values. Since the 
initial values can be any value between the observation n and the total number of genes 
M , we need to extend the searching algorithm (4.7) as follows: 

For any initial value {N^«,^isrf <M, /or i = l,2,-..,g} and any integer k with 
0^*<l|M|h||i^*^ll sachiheA ff'm={N'±l:A;3*{N')^Ap^ (4.13) 

The likelihood gain ftmction is extended similarly as G(0)=0 and 
G{k) = 3-(^' ® *)- ® (*- 1)) (4-14) 

for 

[0165] Algorithm (4.13) preserves all the properties of algorithm (4.7) and it searches 
the ML estimator the same way as that of algorithm (4.7) with two exceptions. Unlike 
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algorithm (4-7), which uses a as initial values of N and at each iteration, the number of 
nonessential genes is increased by one in gene groups of size 5i to find the maximum 
likelihood gain, this algorithm uses N*^ as initial values of N which can be greater than 
the ML estimator. Therefore, at each iteration, the number of nonessential genes in a 
group with size 6i can be either increased or decreased by one such that the likelihood 
gain is maximum. 

[0166] Randomly selected initial values S** were used for data with grouped gene 
sizes and data with exact gene sizes. The estimations of all parameters are exactiy the 
same and the final likelihood for all initial values N** are exactiy the same as indicated 
in Figure 14, which plots twenty seven different initial values of N** . The line in the far 
left represents the likelihood when N** = n, and the lines in the middle are randomly 
selected. Figure 15 is the trajectory projected into a subspace spanned by two gene 
sizes. Each circle represents the projection of a different initial value N° . Regardless of 
the initial values, the trajectories all converge to the ML estimator. 

D. ANALYSIS OF PSEUDOMONAS AERUGINOSA DATA 

1 . Multivariate Model with Exact Gene Sizes 

[0167] The data considered here consist of observations firom 5S70 genes in 88 1 
difif^ent sizes, ranging firom 72 to 16884 DNA base-pairs. Distribution of gene size is 
severely skewed to the right as indicated in Figure 10. For many sizes, especially for 
sizes smaller than 200 or greater than 2000 DNA base-pairs, there is only one gene in a 
given size and the observation of transposon insertions for small genes are usually 
truncated. Since all genes are modeled simultaneously in a single model with a prior y 
enforcing the essentialness of a gene being independent of its size, the sparseness of the 
data does not impose limitations on the computation. However, as discussed in the next 
section, the prior may play a dominating role for small genes where data are sparse. 
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The estimations of y and X, together with the 95 percent BCa confidence intervals are 
presented in Table 6 and the estimation of N is presented in Figure 16. 

Table 6. Parameter Estimation of y and X 

Estimate Bias SE BCa Confidence Intervals 

y 0.8434 3.942x10"^ 9.893x10-^ (0.818, 0.859) 

X 2.547x10"^ -1.027x10"^ 4.392x10-^ (0.00247,0.00264) 
Here the bias and standard error are estimated with bootstrap 
2. Multivariate Model With Grouped Gene Sizes 

[0168] The prior y plays an important role in enforcing the fact that the essentialness 
of a gene is independent of its size. It also made possible to estimate the number of 
essential genes where data are very sparse. However, for small genes where data are 
extremely sparse, the prior y becomes the dominating source of information. In order 
to moderate the dominance of the prior on small genes with sparse observations, we 
grouped the genes into 143 groups according to their sizes, using the median size of 
each group as the gene size. Table 7 is a sample of estimated //based on grouped and 
exact gene sizes. In the table, m is the number of unique sizes in each group; Ni is 
estimated using grouped data andN2 is estimated using ungrouped data. 
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Table 7: Estimated N with Grouped and Exact Gene Sizes 



Gene size m M x N Ni N2 

[72, 120] 6 7 3 2 6 7^ 

(120,150] 4 7 3 2 6 7 

(150,160] 3 7 2 2 6 7 

(160,170] 2 8 0 0 7 7 

(170,180] 3 9 118 8 

(180,190] 3 9 118 8 

(190,200] 3 12 4 4 11 11 

(200,210] 4 27 7 7 23 24 

(210,220] 3 19 7 5 16 17 



[0169] We see that here N^^N^^ However, this is true only for data in the above table 
where the ungrouped data are extremely sparse and most of the data are truncated. The 
estimated proportion of non-essential genes, y, is actually larger for grouped data which 
is presented in Table 8. Grouping genes with similar sizes reduces the sparseness of the 
data and consequently, the dominance of the prior. Another obvious advantage of 
grouping is dimension reduction of the parameter space, and therefore, drastic 
reduction of computation time. Of cause, such grouping introduces another source of 
variation, and the algorithm could be unrobust against different grouping. In our study, 
however, different grouping resulted only in slight difference in estimates. 
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3 . Conditional Maximum Likelihood Estimates 

For a given gene size 5j, the likelihood function (3.4) can be written differently 

as 

hir^MNj)^^^^^^^^^ (5.1) 

[0170] Here/C , .) is defined m (3.1), and x* ^jcJ^, -,3^;^^ are the nj nonzero 

observations from Nj genes of size 5j. Assume there are g different gene sizes, the 
likelihood function can be written as 




[0171] Assuming the number of observations nj for each gene size 5j being fixed, we 
can obtain the conditional maximum likelihood estimate of A. by maximize La as 



(5.3) 
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where iisih;^!;^;, -21 ^^ 



Equation (5.3) reduces to equation (4,2) if we estimate Nj by Nj = 



7%^ proportion of truncated nonessential genes can be calculated as 



p^P{x^Q\non essential) = ^e^^dF{6) . 



(5.4) 



[0172] Here Q is the set of nonessential genes, which can be approximated by the set 
of all untruncated genes. 



Estimations j&om the three approaches are very sinodlar as shown in Table 8. If the 
primary interest is to estimate K and y, the conditional MLE approach has the advantage 
of simplicity. However, in estimating X,, this approach omitted information of M , and y 
is estimated separately after X is estimated. Another obvious limitation of this approach 
is that it can only estimate || N ||, the total number of nonessential genes by || n . The 
estimation of N by «// is not reasonable because even though y is independent of gene 



Therefore, 




(5.5) 
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size, we can not assume the proportion of non-essential genes in difterent sizes being 
the same as shown in Figure 17. 



TabI 8. Estimates of y and X with the Three Approaches 







Estimates 


Bias 


SE 


95%BCa 
Confidence intervals 


Multivariate Model with 
Exact Gene Sizes 


y 


0.843 


3.942x10-^ 


9.893x10-^ 


(0.818, 0.859) 






2.547x10-^ 


-1.024x10-' 


4.320x10-' 


(2.473, 2.642) xlO-^ 


Multivariate Model with 












Grouped Gene Sizes 


Y 


0.853 


7.221x10-^ 


8.051x10-^ 


(0.835, 0.867) 




X 


2.524x10-^ 


2.803X10-*' 


4.063x10-' 


(2.451, 2.610) xlO-^ 


Conditional Maximum 












Likelihood Estimates 


Y 


0.828 


-7.621x10"^ 


7.273x10"^ 


(0.815, 0.843) 






2.539x10-^ 


9.713x10-' 


4.058x10-' 


(2.455, 2.618) xlO-^ 



E. DISCUSSION OF ONE DIMENTSIONAL CASE 

[0173] When the model does not depend on gene size, which can happen for 
example, when we study a subset of genes Avith a fixed size, or in other settings where 
the distribution is identical, model (2.6) reduces to (2.5). Blumentibal, Dayhiya, and 
Gross (1978) studies estimations of complete sample size from an incomplete Poisson 
sample using conditional, unconditional, and modified maximum likelihood functions. 
The modified likelihood estimation weights the likelihood function and maximizes it 



wo 03/089572 



68 



PCT/US02/35518 



This approach is similar to providing priors to X and N. Table 9 presents four types of 
estimations of N using data randomly selected from the 143 grouped genes. Here M 
and n are niunber of genes and number of genes with at least one observed transposon 
insertions. Nm-b is a subset of Ni in Table 7, which is estimated using model (2.6) with 
grouped data; Nb is estimated with model (2.5); Nc and Nu are conditional and 
unconditional estimates of N as described in Blumenthal., Dayhiya, and Gross (1978). 



Table 9: Comparison of Estimations witii Different Methods 



Gene size M n Nm-b Nb Nu Nc 



[72, 120] 


7 


2 


6 


2 


3 


2 


(400-410] 


44 


31 


40 


37 


36 


36 


(430 - 440] 


46 


22 


38 


25 


25 


26 


(470 - 480] 


80 


42 


66 


57 


56 


57 


(500-510] 


54 


30 


45 


39 


39 


39 


(610-620] 


47 


29 


39 


33 


33 


34 


(640 - 650] 


54 


35 


45 


44 


43 


43 


(710-720] 


50 


35 


42 


41 


41 


41 


(750-760] 


56 


37 


46 


39 


46 


40 


(770 - 780] 


61 


43 


51 


53 


52 


52 


(910-920] 


60 


47 


52 


53 


53 


52 


(980-990] 


57 


45 


49 


52 


51 


51 
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(1050- 


1100] 


137 


107 


115 


117 


115 


117 


(1200- 


1250] 


129 


100 


106 


106 


106 


106 


(1400- 


1450] 


121 


110 


111 


112 


112 


112 


(2100- 


2150] 


23 


20 


20 


20 


20 


20 



[0174] We see that the estimations from the tiiree tmivariate models are very similar. 
For fairly large genes, estimations from tiie multivariate model are similar to those of 
the imivariate models. However, for small genes with high truncation rate, estimations 
&om the multivariate model are larger than estimations from the univariate models. In 
the univariate models, only the information related to a particular gene size is used and 
the estimations are obtained separately for each gene size. This approach tends to 
underestimate N for small genes with sparse observations. The multivariate model uses 
a prior to enforce the fact that the essentialness of a gene is independent of its size and 
maximizes the likelihood jointiy for all genes. Therefore, it alleviates the 
underestimation of N for small genes with high truncation rate. 

Example 2: IpxC 

[0175] Lipid A constitutes the outer layer of the outer membranes of gram-negative 
bacteria and is essential for bacterial growth. This makes all the enzymes involved in 
the biosynthesis of this molecule essential for bacterial growth, and therefore ideal 
targets for dmg design. A series of synthetic molecules was previously identified that 
inhibited the first committed step in lipid A biosynthesis. Onishi H. R., B. A. Pelak, L. 
S. Gerckens, L. L. Silver, F. M Kahan, M-H Chen, A. A. Patchett, S. M. Galloway, S. 
A. Hyland, M. S. Anderson, and C. R. H. Raetz. 1996. Science. 274: 980-982. This 
step is catalyzed by a unique deacetylase (UDP-3-0 -[R -3-hydroxymyristoyl]-GlcNAc 
deacetylase), LpxC. 



[0176] UDP-3-0 -[R -3-hydroxymyristoyl]-GlcNAc deacetylase (LpxC) is a 
deacetylase that catalyzes the first committed step of lipopolysaccharide (LPS) 
biosynthesis in gram negative bacteria. This is the second step following the first 
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acylation of N-Acetylglucosamine (GlcNAc). This enzyme functions to deacetylate the 
UDP-3-0 "[R -3-hydroxymyristoyl]-GlcNAc. This step was shown to be essential for 
growth in E, coli wherein a point mutant {EnvAl) expresses an LpxC protein that has 
reduced activity. Beall B. and J. Lutkenhaus, 1987. Sequence analysis, transcriptional 
organization, and insertional mutagenesis of the envA gene of Escherichia coli, J. 
Bacteriol. 169: 5408-5415. A 30% reduction in the amount of LPS on the cell wall of 
such mutants results in hypersensitivity to antibiotics. Attempts to create null mutants 
in IpxC were unsuccessful in a number of pathogenic bacteria, indicating that inhibitors 
of LpxC would be effective antibiotics for a number of gram negative organisms. 

(01771 Previously identified inhibitors are chiral hydroxamic acids that had imique 
hydrophobic aromatic moieties, and were suspected to bind a metal in the active site of 
the deacetylase. The most potent inhibitor, L-1 61 ,240, displayed a minimal inhibitory 
concentration of about 1 microgram per milliliter against E. colU caused three logs of 
bacterial killing in 4 hours, and cured mice uxfected with a lethal intraperitoneal dose of 
E, coli. Considering the very high degree of homology between the E. coli and P. 
aeruginosa enzymes, it was initially presumed that an inhibitor of the E, coli enzyme 
might also inhibit the P, aeruginosa enzyme. However, this molecule inhibited LpxC 
from P. aeruginosa only at very high concentrations, and even then it did so poorly. It 
had no efifect on bacterial growth in this organism. Thus, there was some question as to 
whether the IpxC homologue had the same function in P, aeruginosa^ and whether it 
was essential to P. aeruginosa given its decreased sensitivity to the L, 161,240 inhibitor. 

[0178] Nevertheless, P. aeruginosa IpxC was one nucleic acid identified as being 
unable to accommodate a transposon insertion in the library depicted in Table 1 
(PA4406). To test the essentiality of P. aeruginosa IpxC, we first tested the sensitivity 
of P. aeruginosa transformants expressing E. coli LpxC following a "promoter swap" 
integration. Using this technique, we completely shut off expression of the native P. 
aeruginosa IpxC^ while expressing only the JS. coli enzyme encoded on a plasmid. This 
strategy resulted in a P. aeruginosa mutant that was more sensitive to L-161,240. This 
suggested that the E coli IpxC gene was substituting for the function of the P. 
aeruginosa gene, and moreover, that there were no duplicate functional homologues in 
P. aeruginosa that were active in the absence of IpxC. 
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[0179] Materials. Pseudomonas aeruginosa PAOl was grown at 37°C in Lnria- 
Bertani (LB) broth (Difco) or plated on sheep blood agar O^emel). Tetracycline at 100 
|i,g/ml in LB media was used to maintain the selection of the integrated plasmid 
pBEMl 0 in PAO 1 . LB broth or agar with 1 0 jig/ml of tetracycline was used for 
growing E. coli DH5a (Gibco BRL) and E. colt S-17 transformants. Plasmids pPS72 
and pBADHisB were from Promega and Invitrogen, respectively, EDTA, bis-tri buffer, 
sucrose, arabinose, and DMSO were purchased from Sigma as Ultrapure agents. Yeast 
extract and Tryptone were obtained from Difco. Restriction enzymes, and T4 DNA 
Ligase, and their reaction buffers were from New England Biolabs. Polymixin B 
nonapeptide was from Sigma. The antibiotics, tetracycliiie, ampicillin, carbenicillin, 
gentamicin, and kanamycin were all purchased from Sigma. DNA and deduced amino 
acid information were analyzed using a family of programs included in the Dnastar 
package. BLASTP was used to search for amino acid similarities among a host of 
protein databases available on-line through the National Library of Medicine (USA). 
Altschul, T. F., W. Gish, W. Miller, E. W. Myers, and D. J. Lipman. 1990. Basic local 
alignment tool. J. Mol. BioL 215: 403-410. 

[0180] DNA manipulations. Standard recombinant DNA procedures were used. 
Sambrook, J., E. F. Fritsch, and T. Maniatis. 1989. Molecular cloning: a Laboratory 
Manual, 2""* Edition. Cold Spring Harbor, NY: Cold Spring Harbor Laboratory. 
Primers were designed to the N- and C-terminal regions of the E. coli or P. aeruginosa 
IpxC gene that encompassed only the coding region and included Nde\ and £coRI 
restriction sites for subsequent cloning. For the E. coli gene the primers were (5 - 
GGGAATTCCATATGATCAAACAAAGGACACTTAAACGT-3' and 5'- 
CCGGAATTCTTATGCCAGTACAGCTGAAGGCGCT-30andforP. aeruginosa 
gene they were (5*- 

GGGAATTCCATATGATGATCAAACAACGCACCTTGAAGAACAT-3' and 5'- 
CCGGAATTCCTACACTGCCGCCGCCGGGCGCATATAG-30. These primers were 
used in a polymerase chain reaction (PGR) containing either P. aeruginosa genomic 
DNA (10-50 jj.g) or plasmid pKD6 containing the E, coli IpxC gene (1 .0 |ig ) as 
template (Sorensen, P. G., J. Lutkenhaus, K. Young, S. S. Eveland, M. S. Anderson, 
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and C. R. H. Raetz. 1996- Regulation of UDP-3-0-[if-hydroxymyiistoyl]-JV- 
acetylglucosamine deacetylase in Escherichia coli. The second enzymatic step of lipid 
A biosynthesis. J. Biol. Chem. 271 (42): 25898-25905). The IpxC genes were amplified 
usmg Pwo DNA polymerase (Roche) in a 100 ^il reaction mixture containing 200 \iM 
concentration of each dNTP and 0,5 jiM concentration of each primer for 30 cycles 
(94''C denaturation, 55°C annealing, and 72'*C polymerization (according to the 
manufecturer's instructions). The PGR products were purified with the Qiaquick PGR 
Purification Kit from Qiagen (accordmg to the manufacturer's instructions) and 
digested with Ndel and EcoRl restriction enzymes at sites introduced by the primer 
sequences. Bands of the correct sizes predicted for the IpxC genes were separated by 
gel electrophoresis, and the excised DNA pxirified using the Qiaquick Gel Extraction 
Kit from Qiagen (according to the manxifacturer's instructions). The purified DNA was 
ligated into the T7 expression vector (Studier, F. W., A. H. Rosenberg, J.J. Dunn, and J. 
W. Dubendorff. 1990. Use of T7 RNA polymerase to direct expression of cloned genes. 
Methods Enzymol. 185: 60-89) pET21b (Novagen), that had been cut in the multiple 
cloning site with the same enzymes, transformed into DH5a and plated on LB agar 
containing ampicillin (250 p.g/ml). The resulting clones had their DNA sequenced to 
confiirm the fidelity of the PGR reactions before it could be transferred into the 
expression strain. Subcloning of these fragments into other vectors was carried out as 
needed for expression in various backgroxmds. These included pEXlST (cbK) for 
allelic exchange mutagenesis in P. aeruginosa (Schweizer, H. P and T. T. Hoang. 1995. 
An improved system for gene replacement and xylE fusion analysis in PserAdomonas 
aeruginosa. Gene 158 (1): 15-22), pDN19 (ferR) for low copy number 
complementation of ^. coli JBK-1 (Nunn, D., S. Bergman, and S. Lory. 1990. Products 
of three accessory genes, /;i/B,p//C, and piTD, are reqiiired for biogenesis of 
Pseudomonas aeruginosa pili. J. Bacteriol. 172 (6): 291 1-2919), and pUGP30T (gmR) 
for P. aeruginosa 'promoter swap* mutant complementation (Schweizer, H. P., T. R. 
Glassen, and T. Hoang. 1996. Improved methods for gene analysis and expression in 
Pseudomonas. In: Nakazawa, T,, K Furukawa, D. Haas, S. Silver. (Eds.) Molecular 
Biology of Pseudomonas. American Society for Microbiology, Waslungton, DG. pp. 
229-237). 
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[0181] Construction of pBEMlO and 'promoter swap* mutagenesis. Plasmid 
pPWlOl was made by ligating oriT, the region that encodes conjugative plasmid 
transfer, into pSP72 (Promega). oriT had been amplified from plasmid pEXlOOT 
(Schweizer and Hoang, 1995, supra) with an introduction of an Ndel and BnAatU 
restriction sites. To create the IpxC 'promoter swap' vector, pBEMlO, the following 
different DNA pieces were amplified and sequentially ligated into pPWlOl. These 
included the tetracycline resistance marker (tetK) from plasmid pUCP26 (Olsen, R. H., 
G. DeBxisscher and R. R. McCombie. 1982. Development of broad-host-range vectors 
and gene banks: self-cloning of the Pseudomonas aeruginosa PAO chromosome. J. 
BacterioL 150: 60-69), the araBAD promoter from the plasmid pBAD HisB 
(Invitrogen) with an altered ribosome binding site (rbs) (Guzman^ L.M., D. Belin , M- 
J, Carson , and J. Beckwith. 1995. Tight regulation, modulation, and high-level 
expression by vectors containing the arabinose pBAD promoter. J Bacteriol. 177 
(14):4121-4130), the araC gene, also from pBAD HisB (Lee, N. 1980. Molecular 
aspects of ara regulation. In The Operon. J. H. Miller and W. S. Reznikoff, eds. Cold 
Spring Harbor, NY. Cold Spring Harbor Laboratory, pp. 389-410; and Schleif, R. S. 
1992. DNA looping. Ann. Rev. Biochem. 61: 199-223), and the first 340 base pairs of 
the P. aeruginosa IpxC gene. The tetR marker was amplified using a forward primer 
that introduced a J?^m site (5'- AGATCTCAAGGGTTGGTTTGCGCA-3') and a 
reverse primer that introduced an EcoRl site (5'- 

GAATTCTAATTCTCATGTTTGACA-3'). The araBAD promoter and araC gene 
were amplified as one piece &om the pB AD HisB vector. The forward primer 
introduced anXhol site (5'-CTCGAGGCATGCATAATGTGCCTGTC-3') and the 
reverse primer introduced a HindDl site (5*- 

AAGCTTCTCCTGTTAGCCCAAAAAAACG-30. The rbs was altered from its 
original AGGAG to CTTCT. The foUowdng primer set was used to make these 
changes and introduced an upstream BssWl site (5*- 
GCGCGCGGACGAAAGTAAACCCAC 



TGG-3') and a downstream HindHl site (5*- 

AAGCTTATTCAGAAGGTTAGCCCAAAAAAACGGG-3'). The first 340 bases of 
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PAOl IpxC were amplified firom PAOl genomic DNA. The forward primer introduced 
a Hincm site (5'-AAGCTTATGATCAAACAACGCACCTT-3') and the reverse 
primer introduced ?^Xbal site (5'-TCTAGAAGCGCTGCCATCCATGATCGG-3')- 
These pieces were then Ugated into pPWlOl to form the final product, pBEMlO, which 
was used for the 'promoter swap' mutagenesis of IpxC. The 'promoter swap' scheme is a 
homologous recombination strategy, whereby transformation of pBEMlO into P. 
aeruginosa removed the native IpxC promoter and placed the tightly regulated araBAD 
promoter upstream of the chromosomal copy of IpxQ allowing modulation of its 
egression by the use of a simple sugar, arabinose (Figure 3). In the absence of 
arabinose the IpxC was effectively shut off, and expression was inducible by addition of 
arabinose. Such mutants were selected in the presence of arabinose, and if IpxC is 
essential, these mutants would not be viable in media that is not supplemented with 
arabinose, but fiiUy capable of growth in the presence of arabinose. 

[0182] Growth curves. Bacterial cultures were prepared by diluting stationary phase 
overnight cultures to an ODeoo of 0.1 in 5 ml of LB. The inhibitor, L-161,240, was 
resuspended in DMSO to a final concentration of lOmg/ml and added to the bacterial 
cultures to a final concentration of 50 ^g/ml or 10 fxg/ml. In the samples without 
inhibitor, DMSO was added to keep the final concentration of DMSO equivalent 
between samples. The cultures were incubated with shaking and 0.8 ml was taken for 
OD600 readings over the course of the e^qperiment. DH5a, PAOl, and PA0200 
(Schweizer, H. P. 1998. Intrinsic resistance to mhibitors of fatty acid biosynthesis m 
Pseudomonas aeruginosa is due to efflux: application of a novel technique for 
generation of unmarked chromosomal mutations for the study of efflux systems. 
Antimicrob. Agents Chemother. 42: 394-398) were all grown at ZTC, hi the cases 
where temperature sensitive JBK strains were being assayed, the cultures were grown 
at 42®C for both the overnight and the time course cultures. 

[0183] Outer membrane permeabilization. Polymixm B nonapeptide (Sigma) was 
prepared as a suspension in DMSO at 3 mg/ml final concentration. Erythromycin and 
Tetracycline were resuspended in DMSO to a final concentration of 250 mg/ml and 125 
mg/ml, respectively. L- 16 1,240 was prepared as above in DMSO to a final 
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concentration of 1 Omg/ml. These DMSO antibiotic solutions were individually added 
to LB to the appropriate final concentration and mixed. Polymixin B nonapeptide was 
then added to the appropriate samples and mixed. DMSO was added to each sample to 
keep the final concentration of DMSO eqmvalent between samples. A stationary phase 
overnight culture of PAOl was added to each sample to bring the final concentration to 
0.1 ODeoo- Samples were removed for ODeoo determinations every 1-2 hours for 6.5 
hours and the data from these time points were plotted. 

[0184] MIC determinatioDs for 'promoter swapped* mutants. Single colonies of 
DH5a, PAOl and each promoter swap strain were picked and grown in LB at 3TC with 
shaking for approximately 4 hours. Assimaing that an ODeoo reading of 1 .0 is 
equivalent to 10^ cells/ml, dilutions were made of all cultures to 5x10^ cells/ml. 200 |xl 
of each diluted culture was added to each well where a two-fold serial dilution of 
inhibitor had been placed. The 96-well plates were incubated at 37°C overnight and 
their OD^oo determined using the Spectramax Plus (Molecular Devices) plate reader. 

[0185] To confirm the effect of the arabinose-sensitive promoter in regulating the 
IpxC expression in the swapped mutants, MIC determinations were performed as 
above, except that arabinose was added to induce expression of the chromosomal locus 
and override the effects of the plasmid borne IpxC. In this case the stationary-phase 
overnight bacterial culture was diluted to 5x1 0^ cells/ml in LB containing Arabinose to 
a final concentration of 0.2% (a 20% stock made up in water). 



RESULTS AND DISCUSSION: 

[0186] Homology between the E. coli and the P. aeruginosa LpxC enzymes. Using 
protein analysis software, this study and others have compared the deduced amino acid 
sequence of LpxC firom both E. coli and P. aeruginosa (Hyland, S.A., S. S. Eveland, 
and M. S. Anderson. 1997. Cloning, expression, and Purification of UDP-3-O-Acyl- 
GlcNAc Deacetylase firom Pseudomonas aeruginosa: a metalloamidase of the lipid A 
biosyntiiesis pathway. J. Bacteriol. 179 (6): 2029-2037). This comparison revealed 



wo 03/089572 



76 



PCT/US02/35518 



82% similarity and 57% identity shared between the two sequences. This homology 
was found over the entire length of the protein sequence (data not shown). Significant 
homology with other known acetyl- or acyltransferases was not foxmd, suggesting that 
LpxC is unique among acetyltranferases. The two proteins also share a total of five 
fiilly conserved Histidine residues that are presumed to be responsible for the zinc 
metal cofactor coordination. It was therefore expected that an inhibitor that functions 
by chelating the metal cofactor away would affect both enzymes similarly. 

[0187] LpxC is essential for growth in P. aeruginosa. Since the hydroxamate 
inhibitor was effective in preventing growth ofE. coli, but completely ineffective 
against P. aeruginosa, there was a possibiUty that LpxC was not essential in P. 
aeruginosa. This could be as a result of the presence of another enzyme that catalyzed a 
similar function. If that were the case, elimination of the LpxC function should be 
possible without inhibiting bacterial growth. A thorough analysis of the P. aeruginosa 
genome sequence revealed only one LpxC homologue. An attempt to disrupt the 
function of this LpxC homologue was made by conjugating wild type PAOl with a 
suicide vector (pEXlST) carrying IpxC whose BamHl - SaH fragment had been 
replaced with a gentamicin cassette. However, P. aeruginosa null mutants could not be 
established by this method. In several attempts a few gentamicin resistant trans- 
conjugants were obtained, but in all these cases allelic replacement of the chromosomal 
IpxC by the defective copy had not occurred. Instead, a gene dupUcation had occurred, 
placing the suicide vector and the disrapted copy next to the wild type allele (data not 
shown). This could be demonstrated by the carbenicillin resistance and sucrose 
sensitivity acquired by these trans-conjugants, both of which are encoded on the suicide 
vector. These data indicated a strong negative selection for the sought after disruption 
of IpxC suggesting that IpxC is essential for growth. To confirm this, an experiment 
was carried out whereby the trans-conjugants were transformed with either IpxC on a 
low copy, replicating vector, or vector alone. In 100% of IpxC transformants, resolution 
of the gene duplication as demonstrated by the loss of carbenicillin resistance and 
sucrose sensitivity was observed, as opposed to no such resolution among those 
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transformed with vector alone. These results suggested that the wild type genomic 
allele could be disrupted if a functional copy was present on the transforming plasmid. 

[0187] In another attempt at demonstrating essentiality of LpxC in P. aeruginosa^ the 
'promoter swap' strategy as described in materials and methods was carried out 
'Promoter swapped' pseudomonas mutants were fully capable of growth in the presence 
of arabinose when the arabinose sensitive IpxC promoter was turned on, but completely 
incapable of growth in the absence of this inducer. This further confirmed that in P. 
aeruginosa^ just as in coli^ LpxC is essential for growth. 

[0188] E. coli expressing LpxC from P. aeruginosa is more resistant to L-I6I9 
240. The E. coli strain JBK-l/pKD6 contains the chromosomal IpxC gene disrupted 
with a kan element and a wild type copy of J5. coli IpxC on the temperature-sensitive 
rqplicon pKD6. The strain was constructed as described by Sorensen et al, 1996. Since 
IpxC is essential for growth, this strain is not viable at 42^*0 because the functional 
copy is on the temperature sensitive replicon. Transforming JBK-l/pKD6 with IpxC 
firom either E, coli or P. aeruginosa on a non-temperature-sensitive replicon (pKD19, 
TetK) and selecting at 42°C, produced transfonnants that were viable at 42°C, 
tetracycline resistant, and kanamycin sensitive. This result indicated that IpxC firom P. 
aeruginosa could be expressed in the coli background, and was capable of 
substituting for the missing chromosomal copy. An unexpected result was that ^ereas 
the JBK-1 carrying the IjkcC copy firom E. coli was still sensitive to killing by a slightly 
higher concentration of L-161,240, the JBK-1 carrying the IpxC copy firom P. 
aeruginosa was resistant to up to 50 ^g/ml, about 50 times above the MIC of the wild 
type organisms (data not shown). This suggested that the P. aeruginosa enzyme was 
uniquely resistant to this inhibitor. It also meant that this resistance was the reason for 
the failure to inhibit growth of P. aeruginosa, and not reduced permeability, or efiQux 
or modification of drug by the pseudomonal enzymes. This, in turn, suggests that a 
program designed to search for inhibitors for the pseudomonal enzyme should be based 
on screening directly on that enzyme, and not the surrogate enzyme firom E. colL 
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[0189] Lr-161, 240 is a substrate for the major drug efflux pump of P. aeruginosa. 

The completed P. aeruginosa genome reveals genes for at least nine homologous, 
multicomponent, multidrug efflux systems (Stover et al., 2000, Complete genome 
sequence of Pseudomonas aeruginosa PAOl, an opportunistic pathogen. Nature 406: 
959-64). However the only one that is constitutively expressed to a high degree in the 
wild type strains is MexAB-OprM (Kohler, T., M. Michea-Hamzehpour, and U. Henze. 
1997. Characterization of MexE-MexF-OprN, a positively regulated multidrug efflux 
system of Pseudomonas aeruginosa. Mol. Microbiol. 23: 345-354). Therefore, mutants 
of this e£Qux system can be used to evaluate the consequences of diminished efQux 
pump activity. These mutants would be expected to be highly sensitive to a number of 
antibiotics. Such a mutant, PAO 200, has been isolated (Schweizer, 1998, supra), and 
whereas it shows a higher level of sensitivity to a number of antibiotics (Westbrock- 
Wadman, S. D. R. Sherman, M. J. ffickey, S. N. Coulter, Y. Q. Zhu, P. Warrener, L. Y. 
Nguyen, R. M, Shawar, K. R. Folger, and C. K. Stover . 1999. Characterization of a 
Pseudomonas aeruginosa Efflux Pimip Contributing to Aminoglycoside 
Impermeability. Antimicrobial Agents and Chemotherapy. 43 (12): 2975-2983), it was 
not more sensitive to L-1 61,240 (Figure 4). This suggests that this drag compound is 
not a substrate for this efflux system in P. aeruginosa. 

[0190] P. aeruginosa is not less permeable to Ii-16i;240. Low permeability of the 
outer membrane is a major contributing factor to the observed high levels of intrinsic 
drag resistance in P. aeruginosa (Nikaido, H. 1998. The role of outer membrane and 
efflux pumps in the resistance of gram-negative bacteria. Can we improve access? 
Drag Resistance Updates. 1: 93-98). This low permeability is due to the fact that P. 
aeruginosa lacks the homolog of the relatively efficient, trimeric porins like OmpF. P. 
aeruginosa has, instead, OprF, the OmpA homolog, which produces channels only 
when it is folded into a rare conformation, and only a small fraction of these channels 
occurs in the open conformation. As is usually the case with P. aeruginosa it was 
assumed that the reason L-1 61 ,240 was ineffective against P. aeruginosa was the lack 
of permeability of the outer membrane to this inhibitor. Polymixin B nonapeptide 
(PMBN), a derivative of Polymixin B that lacks the fatty acid tail, is capable of binding 
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to tibe polyanionic LPS molecules and disrapting the bilayer structure, thus mcreasing 
the permeabUity of the outer membrane. PMBN has been used this way to permeabilize 
the outer membrane of many gram-negative bacteria (Vaara, M. and T. Vaara. 1983, 
Sensitization of gram-negative bacteria to antibiotics and complement by a nontoxic 
oligopeptide. Nature 303: 526-528), including P. aeruginosa (Vilianen, P. and M. 
Vaara, 1984. Susceptibility of gram-negative bacteria to polymixin B nonapeptide. 
Antimicro Agents Chemother. 25: 701-705) and effectively sensitize them to lipophilic 
antibiotics. Unlike the acylated polymixin B, PMBN is not cidal. In order to determine 
the effect of outer membrane exclusion of L-161,240, we exposed P. aeruginosa to 
PMBN in combination with L-161,240, and with other lipophilic antibiotics as positive 
controls. Whereas PMBN lowered the MIC of tetracycline for P. aeruginosa more than 
16 fold, Ac sensitivity towards L-161,240 remained unchanged (Figure 5). This, 
together with the E. coli expression data indicated that permeability was not a major 
factor causing the inability of L-161,240 to inhibit pseudomonal growth. 

[0191] P. aeruginosa expressing only E. coli LpxC is more sensitive to L161-240 
than wild type* Using the 'promoter swap' technique as described in the methods, it 
was possible to replace egression from the wild type chromosomal copy of P. 
aeruginosa IpxQ with expression solely from a plasmid borne copy. For tiiis 
experiment, 'promoter swapped' P. aeruginosa was transformed with either vector 
containing P. aeruginosa IpxC ("PA Swap #V^\ or vector containing E. coli IpxC ("PA 
Swap #2*). The transformants were then exposed to various concentrations of L- 
161,240 for MIC determination. Transformants expressing the E, coli enzyme otily 
were much more sensitive to the inhibitor compared to organisms expressing Ihe P. 
aeruginosa enzyme (Figure 6). These transformants were sensitive enough to be 
comparable vnOx the sensitivity seen in E. coli. Since the validity of this observation 
relied on the un-induced arabinose-sensitive promoter to shut down expression from the 
chromosomal copy of IpxC, it was necessary to demonstrate how effectively this 
happens. To do that, MIC detem[iinations were performed as above^ except that 
arabinose was added to induce expression of the chromosomal locus. For this 
experiment stationary-phase overnight bacterial cultures were diluted to 5x10^ cells/ml 
in LB containing 0.2% arabinose. In this case all the transformants, regardless of what 
gene the vector contained, were resistant to killing due to the e>q)ression of the 
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chromosomal copy of P. aeruginosa IpxC. This confkmed that certain intrinsic 
properties of the P, aeruginosa en2yme are resistant to inhibition by this hydroxamate 
inhibitor. It also confirmed that neither reduced uptake, efflux, nor modification of the 
inhibitor play a significant role in this observed resistance. Considering the very high 
similarity between the two &caym&Sy this finding was not expected. 

[0192] But on further examination and analysis of existing data, it was possible to 
recognize some inherent differences that might explain this finding. Whereas both these 
enzymes share five conserved Histidine residues, the E, coli enzyme has two more 
Histidines that have no coxmterparts in the P. aeruginosa enzyme. This is an important 
difference because these residues are probably involved in the metal cofactor 
coordination. It was also observed earlier that whereas the £1 coli enzyme is not 
sensitive to EDTA, the P. aeruginosa enzyme was significantly inhibited by as little as 
2 |iM EDTA. Evidence that the E. coli enzyme is also a metalloenzyme is that the 
envAl mutation, which has one of the conserved Histidines (His 19) replaced by a 
Serine, is sensitive to EDTA. It was because of these observations that these 
investigators suggested that tiie E. coli enzyme has a more stably bound metal than that 
of the EnvAl mutant protein, and thus it is less accessible to EDTA than the wild type 
P. aeruginosa enzyme. These observations suggest that the Histidine 'patch' that is 
involved in the metal coordination is not similar between the two enzymes. It is 
conceivable therefore that since the inhibitor works by chelating the metal cofactor 
away firom the enzyme, each 'patch' has unique features that result in disparate 
reactivities towards the inhibitor. It is also important to consider the findings of 
Wyckoff et al., 1998. Hydrocarbon rulers in UDP-j¥-acetylglucosamine 
acyltransferases. J. Biol. Chem. 273 (49): 32369-32372. These investigators found that 
LpxA, the first enzyme of lipid A biosynthesis, is very selective for the length of its 
acyl donor substrates. Whereas E, coli LpxA prefers iJ-3-hydroxymyristoyl-ACP to i?- 
3-hydroxydecanoyl-ACP, P. aeruginosa LpxA prefers the opposite. The products of the 
LpxA reaction therefore differ in the carbon chain length of their lipid moieties 
between the two bacteria. Since the product of the LpxA reaction is the substrate of the 
LpxC reaction, this observation suggests that the two LpxCs would have substrate 
binding pockets of diSerent sizes to accommodate the different size substrate. That 
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would, in turn, suggest that inhibitors that have to occupy that active site would be 
unique for each enzyme. 

Examples 3-7: ispA, ispB, uppS, aroQ aroK, and metK 

[0193] Several more candidate genes £rom the HTTIM gene database were tested for 
essentiality using a single crossover knock*out strategy. The Pseudomonas genes 
targeted for knocking out were ispA, ispB, uppS, metK, aroC, and aroK. To attempt 
knock-outs, regions of about 300 bp were cloned into the vector pPW120. These 
regions were selected so that known active site residues (or highly conserved residues 
likely to be essential for enzyme fonction) would be separated after generation of a 
single-crossover knock-out. The regions were (numbering from the start codon): ispA^ 
283-594; ispB, 319-610; uppS, 103-402; metK, 415-732; aroC, 385-684; aroK, 175- 
375. 

[0194] The pPW120 vector carries an E. coli origin of replication, but not a 
Pseudomonas origin of replication, making it a suicide vector. It also carries an origin 
of conjugal transfer and antibiotic resistance genes for tetracycline and ampicillin. An 
E. coli donor strain (SMIO) carrying the pPW120 knockout constructs was incubated 
with Pseudomonas strain PAOl to allow conjugal transfer, and recombinants were 
selected by plating onto media containing tetracycline at 100 |i.g/mL and 
chloramphenicol at 10 |Lig/mL. Pseudomonas recombinants will be resistant to this 
antibiotic mixture while wild-type PAOl and Ihe E. coli donor strain will be sensitive. 
Aromatic amino acid recombinants (aroC and aroK) were then tested for auxotrophy 
by plating onto Tninimal media with and without phenylalanine, tryptophan, tyrosine, 
and folic acid at 100 ^ig/mL while maintaining tetracycline selection. The genes ispB, 
uppS and metK did not yield recombinants, demonstrating that they are essential genes 
in all media conditions, while ispA yielded slow-growing recombinants (suggesting that 
this gene may nevertheless be an "important" gene according to the invention). 

[0195] For ispA, ispB, tq}pS, and metK, the conjugation procedure was also done in 
the presence of the complementing plasmid pBAD/HisP. This plasmid has both E, coli 
and Pseudomonas origins of replication, an antibiotic resistance gene for carbenicillin, 
and an arabinose-inducible copy of the full-length wild-type gene. In this way. 
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recombinants with the chromosomal copies of ispA, ispB, uppS, and metK knocked out 
could be isolated since the vector copy would provide complementation. 

[0196] The genes ispB^ uppSy and metK are novel with regard to P. aeruginosa. The 
gene ispB (PA4569, ranging fix)m 51 16864 to 51 17832 in the genome), has 67% 
similarity/52% identity to IspB in E. colU and was assigned to the function class 
concerned with biosynthesis of cofactors, protein groups and carriers, and energy 
metabolism, with a confidence level of 2. It is thought to be involved in the pathway of 
ubiquinone biosynthesis. 

[0197] The gene uppS (PA3652, ranging firom 4091654 to 4090899), coding for 
imdecaprenyl pyrophosphate synthetase, has 69% similarity/57% identity to the uppS 
gene in E. coli, and was assigned to the function class involved in biosynthesis of 
cofactors, protein groups and carriers, cell wall and capsule, with a confidence level of 
2. It is separated by one gene (cdsA) firom dxr, which is involved in the synthesis of 
isopentenyl diphosphate, a precursor of undecaprenol phosphate. 

[0198] The gene metK (PA0546, ranging from 604896 to 603706) had never been 
characterized in P. aeruginosa^ although it is 82% similar/72% identical to MetK in E. 
coll. The gene encodes methionine adenosyltransferase (adomet synthetase) which is 
involved specificedly in methionine metabolism, and was originally assigned to a 
function class of amino acid biosynthesis and metabolism and central intermediate 
metabolism with a confidence level of 2. 

Example 8: rrF 

[0199] The essentiality of the P. aeruginosa rrF (PA3653, ranging firom 4092227 to 
4091670) gene was tested using the promoter swap methodology disclosed herein. The 
N-tenninus region (position 1-327) of the gene encoding the ribosome recycling factor 
(fir) was cloned into the plasmid vector pBEMlO. A single crossover was constructed 
as described above for IpxC. Recombinants were imable to grow in the absence of 
arabinose, confirming the essentiality of this gene. The rrf gene encodes ribosome 
recycling factor, alternatively known as ribosome releasing factor, assigned to the 
functional class pertaining to translation, post-translational modification and 
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degradation with a confidence level of 1 . Although this gene was previously known in 
Pseudomonas aeruginosa, confirming the essentizdity of known genes using the 
methods disclosed herein will reveal new utilities for such genes as targets for the 
identification and design of new antibacterial drugs. 
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conserved hypothetical protein 






PA0543 


9946410 


hypothetical orotein 






PA0544 


9946411 


hypothetical protein 






PA0546 


9946414 


methionine adenosyltransferase 


metK 




PA05S0 


9946418 


conserved hypothetical protein 




ygbM 


PA0552 


994642C 


DhosDhoolvcerate kinase 






PA0553 


9946422 


hvDothetical orotein i 




IPA0555 


9946424 


[ fructose-1 .6-bisphosphate aldolase 


:fda 


fbaA. cbbA. cfxB | 


^PA0559 


9946428 Iconserved hypothetical protein 


i 


yhiN 
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PA0563 t 9946432 i 


:;onserved hypothetical protein * ! 


PA0565 9946434 ( 


conserved hypothetical protein i ! : 


PA0567 1 9946437 < 


conserved hypothetical protein ! ivqaE 


PA0570 9946440 


hypothetical protein ^ 1 ' 


PA0571 ! 9946441 


hypothetical protein i * 


PA0574 i 9946444 


hypothetical protein * | 


PA0578 ' 9946449 


conserved hypothetical protein ; \ ... _ . J 


PA0579 9946450 


30S ribosomal protein S21 irpsU i i 


PA0580 i 9946451 


O-sialoglycoprotein endopeptidase 


gcp 


mD I 


PA0582 ! 9946453 


dihydroneopterin aldolase 


folB 


1 


PA0585 j 9946456 


hypothetical protein 






PA0589 1 9946461 


conserved hypothetical protein 




pIdE 


PA0591 1 9946463 


conserved hypothetical protein 




apaG 


PA0593 j 9946465 


pyridoxal phosphate biosvnthetic protein PdxA 


pdxA 




PA0594 j 9946466 


peptidyl-prolyl cis-trans isonnerase SurA 


surA 




PA0595 9946467 


organic solvent tolerance protein OstA precur^ 


ostA 


imp 


PA0607 


9946481 


ribulose-phosphate 3-epimerase 


rpe 


dod 


PA0610 


9946484 


transcriptional requlator PrtN 


prtN 




PA061 1 


9946485 


transcriptional regulator PrtR 


prtR 




PA0613 


9946487 


hypothetical protein 






PA0614 


9946488 


hypothetical protein ' 






PA0617 


9946492 


probable bacteriophage protein 






PA0626 


9946501 


hypothetical protein 






PA0627 


9946502 


conserved hypothetical protein 






PA0630 


9946506 


hypothetical protein 




Ml , - - 


:PA0632 


9946507 


hypothetical protein 






iPA0635 


9946511 


hypothetical protein 






PA0639 


9946516 


conserved hypothetical protein 






PA0642 


9946518 


hVDothetical protein 






PA0643 


9946519 


hypothetical protein ! 




PA0644 


9946520 


hypothetical protein ! 




PA0646 


9946523 


hypothetical protein ! i 




PA0647 ] 


9946524 


hypothetical protein 1 




PA0648 


9946525 


hypothetical protein j 




PA0652 


9946529 


transcriptional regulator Vf r 1 vf r 




PA0653 


9946530 


conserved hypothetical protein | 


yhfA 


PA0655 


9946532 


hypothetical protein ! 




PA0660 


9946538 


hypothetical protein 1 




PA0661 


9946539 


conserved hypothetical protein 1 




PA0665 


9946543 


conserved hypothetical protein ! 


yadR 


PA0678 


9946558 


probable type II secretion system protein I 


hxcU 


PA0679 


9946559 


hypothetical protein j 


hxcP 


PA0680 


9946560 


probable type II secretion system protein { 


hxcV 


PA0684 


9946564 


probable type II secretion system protein ? 


hxcZ 


PA0686 


9946566 


probable type II secretion system protein | 


hxcR 


PA0687 


9946567 


probable type II secretion system protein i 


hxcS 


PA0689 


994657C 


1 hypothetical protein i 




PA0697 


9946578 


) hypothetical protein j 




PA0698 


994658C 


I hVDOthetical orotein \ 




PA0700 


9946582 


\ hypothetical protein ; \ 


PA0702 


994658£ 


) hypothetical protein • ! 


iPA0704 


9946587 


' probable amidase \ 1 
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•PA0705 


9946588 


Drobable alvcosvl transferase ! 


JmigA 


!PA0708 


9946591 


probable transcriptional regulator - \ i - 


IPA0709 ' 


9946592 


hypothetical protein 1 j 


iPA0710 


9946593 


lactoylqlutathione lyase igloA2 ! . . _ . 


IPA0712 


9946595 


hvDothetical orotein 


{ 




IPA0714 


9946598 


hypothetical protein 1 ! 


;PA0715 


9946599 


hypothetical protein i \ 


PA0716 


9946600 


hvpothetical protein \ \ 


PA0717 


9946601 


hypothetical protein of bacteriophage Pf 1 




■ 


PA0720 


9946604 


helix destabilizing protein of bactenophage Pr 






PA0728 


9946613 


probable bacteriophage integrase 




— — "] 


PA0729 


9946614 


hypothetical protein 




f 


PA0730 


9946615 


probable transferase 






PA0733 


9946618 


probable pseudouridylate synthase 




rsuA 


PA0734 


9946619 


hypothetical protein 






PA0738 ^ 


9946624 


conserved hypothetical protein 






PA0742 


9946628 


hypothetical protein 






PA0759 


9946647 


conserved hypothetical protein 






PA0763 


9946651 


anti-sigma factor MucA 


mucA 




PA0767 


9946655 


GTP-binding protein LepA 


lepA 




PA0768 


9946656 


siqnal peptidase 1 


lepB 


lep; SPASE 1 


PA0771 


9946660 


GTP-bindina protein Era 


era 




PA0776 


9946665 


hypothetical protein 






PA0778 


9946667 


hypothetical protein 






PA0786 i 


9946676 


probable transporter ' 






PA0787 


9946677 


hypothetical protein 






PA0790 


9946681 


hypothetical protein 






PA0802 


9946694 


hypothetical protein 






PA0805 


9946697 


hypothetical protein 






PA0808 


9946701 


hypothetical protein 






PA0815 


9946708 


probable transcriptional regulator 






PA0818 


9946712 


hypothetical protein 






PA0820 


9946714 


hypothetical protein 






PA0822 


9946716 


hypothetical protein 






PA0825 


9946719 


hypothetical protein 






PA0826 


9946720 


hypothetical protein 






PA0S29 


9946723 


probable hydrolase 






PA0837 


9946732 


peptidyl-prolyl cis-trans Isomerase SlyD 


SlyD 




PA0850 


.9946747 


hypothetical protein 






PA0851 


9946748 


hypothetical protein 






PA0853 


9946750 


probable oxidoreductase 






PA0857 


9946754 


morphogene protein BolA 


bolA 




PA0862 


9946760 


hypothetical protein 






PA0867 


9946765 


hypothetical protein 






PAOoDO 




conserved hypothetical protein 




yaeJ 


PA0869 


9946767 


D-alanyl-D-alanine-endopeptidase 


pbpG 




PA0871 


9946770 


pterin-4-alpha-carbinolamine dehydratase 


phhB 




PA0874 


9946773 


hypothetical protein 






PA0880 


9946779 


Drobable rina-cleavina dioxygenase 






PA0894 


9946794 


hypothetical protein 




PA0900 


9946801 


hvDothetical orotein i 




iPA0903 


9946804 


alanyl-tRNA synthetase 


lalaS 


sya 
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: PAQ904 
IPAO905 
iPA0906 



9946806 1 aspartate kinase alpha and beta chain. 



9946807 



llysC' 



!ask; akaB 



carbon storage regulator 



IcsrA 



iPA0908 
iPA0909 



iPA0913 



PA0922 



PA0927 



PA0932 



9946808 1 probable transcriptional regulator 



9946810 



994681 1 



lypothetical protein 



9946816 



probable Mg transporter MgtE 



9946825 



9946831 



lypothetical protein 



9946836 



PA0937 i 9946842 



cysteine synthase B 



conserved hypothetical protein 



lypothetical protein 



inngtE 



lactate dehydrogenase (fermentative) 



IdhA 



jldhP 



cysM 



PA0939 



PA0944 



PA0945 



PA0947 



PA0953 



9946844 



lypothetical protein 



9946849 



9946850 



hosphoribosylaminoimidazole synthetase jpurN 



hosphoribosylanninoimidazole synthetase 



9946853 



conserved hypothetical protein 



9946859 



probable thioredoxin 



helX 



PA0954 



9946860 



probable acylphosphatase 



PA0956 



9946862 



prolyl-tRNA synthetase 



proS 



PA0960 



9946867 



lypothetical protein 



PA0962 



PA0969 



PA0970 



9946869 



probable dna-binding stress protein 



9946876 



TolQ protein 



tolQ 



9946877 



ToIR protein 



toIR 



PA0971 



PAQg72 



PA0973 



PA0976 



9946878 



TolA protein 



tolA 



9946879 



TolB protein 



tolB 



9946880 



outer membrane protein OprL precursor 



oprL 



pal excC 



■ 9946884 



conserved hypothetical protein 



PA0978 



9946886 



conserved hypothetical protein 



PA0979 



PA0980 



9946887 



conserved hypothetical protein 



9946888 



lypothetical protein 



PA0981 



9946889 



lypothetical protein 



PA0983 



9946891 



conserved hypothetical protein 



PA0985 



9946893 



probable colicin-like toxin 



PA0986 



PA0990 



PA0991 



PA0993 



PA1000 



9946894 



conserved hypothetical protein 



9946899 



consen/ed hypothetical protein 



9946900 



hypothetical protein 



9946902 



9946909 



probable pili assembly chaperone 



lypothetical protein 



PA1006 



9946916 



conserved hypothetical protein 



.yd<L 



PA1008 



PA1010 



994691 



bacterioferritin comigratory protein 



bcp 



9946920 



dihydrodipicolinate synthase 



dapA 



PA1012 



9946922 



conserved hypothetical protein 



PA1013 



PA1021 



PA1026 



PA1035 



PA1038 



9946923 



phosphoribosylaminoimidazole-succinocarboxpurC 



9946932 



probable enoyl-CoA hydratase/lsomerase 



9946938 



9946948 



hypothetical protein 



hypothetical protein 



9946951 



hypothetical protein 



PA1039 



PA1049 



PA1055 



9946952 



conserved hypothetical protein 



ychJ 



9946963 



9946969 



pyridoxine 6'-phosphate oxidase 



conserved hypothetical protein 



pdxH 



phaC 



PA1063 



9946978 



hypothetical protein 



PA1068 



PA1076 



9946983 



9946992 



probable heat shock protein (hsp90 family) 



hypothetical protein 



PA1088 



9947004 



hypothetical protein 
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{PA1089 


9947005 ( 


conserved hypothetical protein 


f 1 
i — ! 


*PA1090 


9947006 1 


lypothetical protein 




I t f* 1 w W w • 

SPAIOQS 


9947012 1 


iVDothetical protein 


ifliS , 


IPA1098 


9947015 1 


wo-conriDonent sensor fleS j 1 


!PAiin2 


9947019 1 


laaellar motor switch protein FliG 


fliG • 


iPA'HOS 


9947022 1 


flaaellar protein FliJ -fJ 1 


IPA1106 


9947023 1 


hypothetical protein 






fpA1107 


9947025 i 


conserved hvDOthetical protein 


i i 


1 tr\ 1 1 w 1 

r A M 1 *t * 


9947032 


hypothetical protein 




D Ai *! 1 ft 


9947037 


hypothetical protein \ 




DA 1 -ion 


9947039 

1 www 


consen/ed hypothetical protein • 


yfiN 


-'AT 1 


Q947041 

W w" t w~ I 


probable peptide deformylase 


def, pdf, fms 


D A -i 'io*; 


9947044 

ww" # w~~ 


probable cobalamin biosynthetic protein 


cobB 


-'AT 1 M j 

-'A 1 1 


9947049 


probable fosfomycin resistance protein \ 






994.7054 


hypothetical protein \ 






■'AT 1 OO 


9Q47055 

v7w" / www 


conserved hypothetical protein ! 




yedU 


n A «t H OQ 

-'AT 1 oo 


0047058 


probable transcriptional regulator ! 






-'AT 140 


575?*+ / UDU 


probable transcriptional regulator 1 






D A H ylO 

-'A1 1 4y 


00/17071 


hypothetical protein 1 






-'Allol 


QQA7A7'^ 


pyocln S2 immunity protein 


imm2 




IK. A A C A 


QQA707R 


conserved hypothetical protein 






r> A. A A cc 


QQA7077 


ribonucleoside reductase, small chain , 


nrdB 




PA1156 


yy4- / Uf o 


ribonucleoside reductase, large chain 


nrdA 




PA1 1 57 


QQ>470ftO 

yy^ / uou 


prnhahle two-component response regulator 


1 




K A A CO 

PA1 1 59 


QQil70A9 


probable cold-shock protein . 




PA1160 


yy^/ uoo 


hvDothetical orotein ! 




K A A CO 


QQA70R'^ 


irrJnvl-riiaminoDimelate desuccinylase 


idapE 






QQA70ft7 

yy^ f uo f 


conserved hypothetical protein ! 




PA1 1 65 


QQil70Rft 


hypothetical protein ? 




PAnof 


QQA70Q1 
yy+ f v/57 1 


hypothetical protein ' 




PAnoo 


QQil7nQ9 


hypothetical protein ; 




DAi -170 


9947096 


cytochrome c-type protein NapC 


InapC 




DAI 17*5 
PA 1 I/O 


QQ4.7097 


cytochrome c-type protein NapB precursor 


^napB 




DA 'I -17^ 
P A 1 1 / D 


' QQJ.7inn 

r 5757*T / 1 \J\f 


ferredoxin protein NapF 


:napF 




DA-i *177 
PAT \ I I 


\ QQil7ini 


perlplasmic nitrate reductase protein NapE 


inapE 






9947108 


n4.HicarboxYlate transport protein 


idctA 




DAi iQO 

PA J 1 y O 


99471 19 

w»7*T# 1 Iw 


hypothetical protein * 




PA iZUo 


9947130 


hypothetical protein i 




rA 1 c\}*\ 


9947131 


conserved hypothetical protein ! 


yieF 


KA IZUD 


9947133 

WC7*T f 1 WW 


hypothetical protein ' 




PA1213 


994714' 


hypothetical protein 1 




PA1215 


9947143 


; hypothetical protein I ^ 




rA l<t ID 


57w*T/ 1 "rn 


hypothetical protein 




PA1917 


9947145 

W w 1 1 1 


probable 2-isopropylmalate synthase 




TA1219 


9947147 


' hypothetical protein 




PA1223 


9947152 


\ probable transcriptional regulator i 




PA1224 


9947151: 


\ probable NAD(P)H dehydrogenase j 




PA1228 


994715/ 


^ hypothetical protein ! 




PA1230 
PA1233 


9947155 
994716: 


) hypothetical protein ! 
I hypothetical protein 


\ 


PA1237 


994716" 


J orobable multidrua resistance efflux pump 




iPA1250 


994718' 


1 alkaline proteinase inhibitor AprI 


;aprl 1 
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;PA1261 


9947193 f 


probable transcriptional regulator ' : 


.PA1269 


9947202! I 


DFobable transcriDtlonal regulator i . 


:PA1280 


9P472 1 4 ! hvpothetical protein 5 


(cobC 




iPA1285 




Drobable transcriptional regulator i : -j 


1pA1295 ^ 


9947231 ( 


conserved hypothetical protein \ 


iycgL 




2 1 # » 1 *• w V 

iPA1298 


9947234 ( 


conserved hypothetical protein * 


iyohL 




'PA1300 




nrobable siama-70 factor. ECF subfamily i 


: 

i 




1 


■PA1306 




nrobable HIT familv orotein \ ! I 


1 1 t 1 1 w W 

!PA1307 




conservea nypoineucai piuicni 




vafJ 




yy4/ ^oz 


proDaDie transcripiionai reguiaior 






r /A 1 1 t 


yy*ff zoy 


CytOCnronne O UDlCjUinoi OXlOase pruiein v-^yun- 


cyoE 






QO/l79R'l 

yy*t^zD 1 


hypothetical protein 






PA1'^9R ' 


yy4/ ZD / 


probable transcriptional regulator 






PAi'^'^l 1 


yy4 f Zfv 


conserved nypotneticai protein 




yp?" — 




9947ZOZ 


probable Dinding proiein componeni oi mdo uu 






PAi'^^n 


9947289 


hypothetical protein 








9947290 


conserved hypothetical protein 






nAIOOU 1 


9947291 


hypothetical protein 








9947293 


conserved hypothetical protein 








9947295 


hypothetical protein , 






OA*! 'i'^*^ 

r'Alooo 


9947297 


hypothetical protein ' 






DA '1 ItZQ 


99473Q0 


hypothetical protein 






OAi 'IRO 


9947304 


hypothetical protein 








9947306 


probable transmembrane sensor 






nA\ 1 ODD 


9947309 


hypothetical protein 






HAT ooy 


9947312 


hypothetical protein 






r AT O / U 


9947313 


hypothetical protein 






D Al '^T'i 


9947314 


hypothetical protein 






KAl o f Z 


9947315 


hypothetical protein 






r A'l O # O 


9947319 


erythronate-4-phosphate dehydrogenase 


pdxB 




n A \ Ot f 


9947321 


conserved hypothetical protein 




vhhY 

y"".f — 


PAi '^7ft 


9947322 


hypothetical protein 






PAI "^70 


994 /oZo 


nrnhahle short-chain dehydrogenase 






PAI *^Q4 


yy4/ o4u 


hypothetical protein ! 




PAI'^QT 

r r% 1 WW f 


994/ o4o 


nrobable two-comoonent response regulator \ 




PAI '^QR 


994 / o44 


hypothetical protein 






PA1il04 


994/ 001 


hypothetical protein 








994 ( oOC 


acetylpoiyamine aminohydrolase 


aphA 




PA149fi 


994 f O/O 


hypothetical protein 






iPA1497 

r /A l*t^ / 


y y*t / o / c 


hypothetical protein 






PA14'^1 
r /A 1 *TU 1 


yy^ / oou 


regulatory protein RsaL 


rsaL 




PAI 432 


9947381 


autoinducer synthesis protein LasI 


lasi 




PA1442 


9947393 


conserved hypothetical protein 




fliL 


PAI 447 


994739£ 


flagellar biosynthetic protein FliQ 


fliQ 




PA1454 


9947406 


) flagellar synthesis regulator FleN 






PA1456 


994740E 


1 two-component response regulator CheY 


CheY 




PA1461 


994741 J 


\ probable chemotaxis protein 




mote 


PA1462 


994741^ 


[ probable, plasmld partitioning protein 






PA1464 


994741"/ 


r nrobable Durine-bindina chemotaxis protein 


1 

; 


cheW ^ 


1PA1465 


994741 { 


J hypothetical protein 1 


{, — ' — — ^-i 


IPA1468 


994742' 


1 hypothetical protein 




1PA1472 


994742i 


> conserved hvpothetical protein 


1 
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"PA1475 9947428iheme exporter protein CcmA iccmA '#NAME? 

•PA1^76 9947a9Qihpmp PY^nrt^r protein CcmB :ccmB '0x110; cycW. helB___ _ 


•DAiZ77 QQ47431 


heme exoorter orotein CcmC ,ccmC .pfcytl cycZ heIC 


lOA^A7f^ i QQ47432 


hypothetical protein i jpfcyt2 ccmD cycX helD i 


iDA-iApn QQ47434 


cytochrome C-type bioqenesis protein Com F jccmF icycK; cell _ „ 


ioAi/iAi » QQ47435 


cytochrome C biogenesis protein CcmG IccmG *dsbE 


! D A -1 /I QO C1Q47436 


cytochrome C-type bioqenesis protein CcmH jccmH :ccl2 cycL 


irA14oo yy**/^HZ 


hypothetical protein ! _ _ 


IPA1489 9947443 


hypothetical protein * 


!PA1492 : 9947447 


hypothetical protein } - • 


iPA1496 i 9947451 


probable potassium channel • * J 


PA1504 9947460 


probable transcriptional regulator j ; j 


PA1508 j 9947464 


hypothetical protein 


1 j 


PA1509 j 9947465 


hypothetical protein 


1 ..-1 


PA1514 ! 9947471 


conserved hypothetical protein 


iybbT 


PA1517 1 9947474 


conserved hypothetical protein 


! 


PA1518 1 9947475 


conserved hypothetical protein 






PA1526 


9947484 


probable transcriptional regulator 






PA1528 


9947486 


cell division protein ZipA 


zipA 




PA1529 


9947487 


DNA ligase 


»g 


onaL iigA 


PA1532 


9947490 


DNA polymerase subunits gamma and tau 


dnaA 




PA1533 


9947491 


conserved hypothetical protein 






PA1535 


9947494 


probable acyl-CoA dehydrogenase 




1 


PA1539 


9947498 


hypothetical protein 






PA1540 


9947499 


conserved hypothetical protein 






PA1641 


9947500 


probable drug efflux transporter 






PA1548 


9947508 


conserved hypothetical protein 




11 AO 


PA1551 


994751 1 


probable ferredoxin 




fivG 


PA1555 


9947515 


probable cytochrome c 




fixP ccoP 1 


PA1558 


9947519 


hypothetical protein 






PA1559 


9947520 


hypothetical protein 






PA1560 


9947521 


nypotneticai protein 




i 


PA1564 


9947526 


consen/ed hypothetical protein 




1 


PA1568 , 


9947530 


conserved hypothetical protein 






PA1571 


9947533 


hypothetical protein 






PA1681 


9947544 


succinate dehydrogenase (C subunit) 


sdhC 


cvbA 


PA1582 


9947545 


succinate dehydrogenase (D subunit) 


sdhD 




PA1583 


9947546 


succinate dehydrogenase (A subunit) 


sdhA 




PA1584 


9947547 


succinate dehydrogenase (B subunit) 


sdhB 




PA1587 


9947550 


lipoamide dehydrogenase-gic 


IpdG 


IpdA j 


PA1588 


9947552 


succinyl-CoA synthetase beta chain 


sucC 


\ 

_^ ^ 1 


PA1589 


9947553 


succinyl-CoA synthetase alpha chain 


sucD 


J 


PA1591 


9947555 


hypothetical protein 




i 


PA1592 


9947556 


hypothetical protein 




"1 

_ ! 


PA1593 


9947557 


hypothetical protein 




i 


PA1594 


9947558 


hypothetical protein 






PA1595 


9947559 


hypothetical protein 






PA1610 


9947576 


beta-hydroxydecanoyl-ACP dehydrase 


fabA 




PA1618 


9947584 


conserved hypothetical protein 




vbdB 


PA1619 


9947585 


probable transcriptional regulator 






PA1622 


9947589 


orobable hvdrolase 


J 




PA1623 


t 9947S9C 


) conserved hypothetical protein : . 
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iPA1624 


9947591 \ 


lypothetical protein 


i 
1 




^PA1630 • 


9947597 f 


)robable transcriptional regulator ^ 






;PA1632 


9947600 \ 


<dpF protein ^kdpF \ _ _j 


IPA1635 


9947603 \ 


^Qtassium-transportina ATPase. C chain ?kdpe; jatKU 1 


PA1638 • 


9947606 ( 


conserved hVDothetical protein 




1 / » 1 www 

PA1641 \ 


9947610 1 


lypothetical protein ^ 


PA1645 i 


9947614! 


lYpothetical protein j 


PA1657 ! 


9947627 


-onserved hvDothetical oroteln : i 


i 


PA1664 


9947635 1 


hypothetical protein 1 






PA1666 ! 


9947637 


hypothetical protein \ 








9947645 


hypothetical protein 






PA1674 ! 


9947646 


3TP cyclohydrolase 1 precursor 


folE2 




jrrK 1 D / O | 


9947647 


conserved hypothetical protein 






PA1676 


9947648 


hypothetical protein 






r A 1 Do / 


9947660 

WW~f WVp/W 

9947663 


spermidine synthase 

translocation protein in type III secretion 


speE 




PA1691 


9947664 

QQ47fi69 

5J57*T / Ww 


translocation protein in type III secretion 
translocation protein in type III secretion 


pscT 
pscO 




PA1698 
PA1701 


9947672 
9947676 


outer membrane protein PopN 

conserved hypothetical protein in type. Ill secr^ 


popN 




PA1702 
PA1705 


9947676 
9947679 


conserved hypothetical protein in type III secrd 
regulator in type III secretion . 


pcrG 


pcr4 


PA1710 1 


9947684 


exoenzyme S synthesis protein C precursor 


exsC 




PA171 1 




hypothetical protein 

type III export protein PscE 


pscE 




PA1719 

D A *i 7on 
P A1 720 

PA1722 


QQ2l7fiQ4 

9947697 


type 111 export protein PscF 
type III export protein PscG 
type III export protein PscI 


pscF 
pscG 
psci 




PA1732 
D A-i 7*a*a 

r/\ 1 f *fO 


9947708 
0047709 

9947720 


conserved hypothetical protein 
conserved hypothetical protein 
hvDothetical orotein 






PA1745 


9947722 


hypothetical protein 1 . 




PA1747 


9947725 


hypothetical protein ! 




1 f OU 


9947728 

^w"f « a—w 


nhnftnhn-9-dehvdro-3-deoxvheDtonate aldolase 






9947735 


homoserine kinase 


IthrH 




1 / DO 


9947747 
9947752 


lypomeiicai proiem 

nrnhflble methvltransferase i 


menG 


DAi777 


9947757 


outer membrane protein OprF precursor 


joprF 






9947760 


assimilatory nitrite reductase small subunit 


InirD 
InasA 


nasE 
nasT 


PA1783 
PA1785 


994776^ 
9947766 


nitrate transporter 
► conserved hypothetical protein 




nasT 


PA1787 


9947766 


\ aconitate hydratase 2 


}acnB 




PA1790 


9947772 


\ hypothetical protein 


\ 




PA1792 


994777^ 


conserved hypothetical protein 


1 


vbbF 


PA1794 


9947776 


alutaminvl-tRNA synthetase 


IglnS 




PA1795 


9947777 


' cysteinyl-tRNA synthetase 


cysS 




PA1796 
PA1803 


9947776 
9947786 


5.10-methylene-tetrahydrofolatedehydrogen£ 
Lon protease 


SfolD 
lon 
mhA 


capR deg muc lopA 


PA1815 
PA1816 
PA18.17 
,PA1820 


994780( 
994780' 
994780: 
9947801 


) ribonuclease H 

DNA polymerase III. epsilon chain 
I hypothetical protein 

51 sodium/proton antlporter NhaB 


idnaQ 
inhaB 


imutD 

1 

\ 
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994781 1 i 


hvDothetical protein J i ^ — 




99478161 


hypothetical protein 1 








!PAia35 I 

1 1 r\ i Www * 


9947821 1 


hypothetical protein * 








nr\ lOO f 


9947823 


hvDothetical protein i j ... . 


r A 1 0*tU 


9947827 


hypothetical protein ' 










9947829 


hvDothetical orotein 










9947832 


hypothetical protein \ 


PA1847 i 


9947835 


conserved, hypothetical protein i_ 


iyhgl 




9947840 


hypothetical protein j 


5, 




PA1855 


9947843 


hypothetical protein 


j 
t 




D A •I fiCQ 

rAiooy 


9947848 


orobable transcriptional regulator 


1 




PA1862 


9947851 1 molybdenum transport protein ModB 


modB 




PA1867 


99478571 hypothetical protein 




xphA i 


PA1869 


9947859 probable acyl carrier protein 




! 


PA1872 
PA1882 


9947862 hypothetical protein 
9947873 probable transporter 






PA1883 
■'A18o4 


99478741 probable NADH-ubiquinone/plastoquinone oxii 
99478751 probable transcriptional regulator 






PA1885 
PA1892 


9947876 Iconserved hypothetical protein 
9947884 hypothetical protein 






PA1894 
PA1895 
PA1896 
PA1897 
PA1899 


9947886 hypothetical protein 

99478871 hypothetical protein 

99478881 hypothetical protein 

9947889 1 hypothetical protein • 

9947892 probable phenazine biosynthesis protein 




phzA2 


PA1900 1 
PA1905 i 
PA1911 


9947893 probable phenazine biosynthesis protein 
9950429 probable pyridoxamine 5'-phosphate oxidase 
9947904 probable transmembrane sensor 

.1 ■ i-i t -re\ * i,-.^ crorr eMiKfomitw 




phzB2 
phzG2 


PA1912 
PA1914 
PA1917 


yy4/yuo 
9947907 
9947910 


proDaoie Sigma- lauiui, ^auwiwim.jr 
conserved hypothetical protein 
hypothetical protein 




hvn 


PA1924 
PA1925 
PA1928 

^Ai yob 

3 A -1 007 

-'Ai y o / 


9947918 
9947919 
9947923 
9947924 
9947932 
9947933 


hypothetical protein 
hypothetical protein 

ribosomal protein alanine acetyltransferase 
hypothetical protein 
hypothetical protein 
conserved hypothetical protein 


rimJ 




n A 4 no D 

PAiyoo 
PA1 yoy 

DA A OC'I 

PA1 951 
rA J y 
r^A lyoo 


9947934 
9947935 
9947949 
9947950 
9947953 


conserved hypothetical protein 
hypothetical protein 
hypothetical protein 
hypothetical protein 
hypothetical protein 




1 

— — j 

— . i 

i 


DAi Qf>P 

nAI yoD 


9947954 


hypothetical protein 




i 

■ ■ ' 




. 9947960 


conserved hypothetical protein 






PA1963 


9947962 


hypothetical protein 






PA1965 


9947964 


hypothetical protein 






PA1967 


9947966 


hypothetical protein 






PA1968 


9947967 


hypothetical protein 






PA1974 


9947974 


hypothetical protein 


I 




PA1978 


9947978 


probable transcriptional regulator 




agmR 


PA1980 


9947980 


probable two-component response regulator 


i 


PA1985 


9947986 


pyrroloquinoline quinone biosynthesis protein /.HxiqA 


f 


1 
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PA1986 


9947987 \ 


Dyrroloquinoline quinone biosynthesis protein ^ 


3qqB { 


„ \ 


PA1988 


9947989 j 


wrroloauinoline auinone biosynthesis protein IC] 


DqqD ; 




PA1994 i 


9947996 1 


lypothetical protein ! 




PA1995 ! 


. 9947997 1 


ivDothetlcal orotein i 





PA1996 ; 


9947998 | 


Deptidyi-prolyl cis-trans isomerase C1 | 


ppiC1 




PA2001 i 


9948003 j! 


acetyl-CoA acetyltransferase 1^ 


atoB 




!PA2002 : 


9948004 ( 


::onsen/ed hypothetical protein j 




atoc 


PA2007 I 


9948010 


maleylacetoacetate isomerase i 


maiA 


; 


PA2010 i 


9948013 


probable transcriptional regulator ? 




. 


PA2013 ! 


9948016 


probable enoyl-CoA hydratase/isomerase 




menB i 


PA2015 j 


9948019 


probable acyl-CoA dehydrogenase 




1 


PA2016 1 


9948020 


probable transcriptional requlator 




: 


PA2017 1 


9948021 


hypothetical protein 






PA2021 


9948025 


hvnnthpilcal orotein 




i 


PA2026 


9948031 


conserved hypothetical protein 




yfeH 


PA2029 


9948034 


hvDothetical orotein 




1 


PA2031 


9948036 


hvDothetical orotein 






PA2034 


9948039 


hvnotheticai orotein 






PA2037 


9948043 


hvDothetical orotein 






PA2042 


9948048 


nrobable transoorter (membrane subunit) 




ygju ^ 


PA2051 


9948058 


nrnhahip transmembrane sensor 






PA2052 


9948059 


n\/anAtp tva^p 


cynS 




PA2060 


9948068 


nrnhflhip nprmpase of ABC transporter 




I 
» 


PA2062 


9948071 


nrnhahip Dvridoxal-DhosDhate dependent enzyJ 






PA2066 


9948075 


hvnnthpt trial orotein 




i 


PA2071 


9948081 


^lon nation factor G 


fusA2 




PA2073 


9948083 


orobable transoorter (membrane subunit) 






PA2074 


9948084 


hvDothetical Droteln 






PA2075 


9948086 


hvDothetical orotein 






PA2080 


9948091 


hvDotheticai orotein J 






PA2088 


9948100 


hvDothetical orotein 






PA2090 


9948102 


hvnotheticai orotein 







PA2092 


9948104 


probable MFS transporter 






PA2095 


9948108 


hypothetical protein 






PA2097 


9948110 


probable flavin-binding monooxygenase 






PA2101 


994811^ 


conserved hypothetical protein 






PA2103 


9948117 


probable molybdopterin biosynthesis protein K 




moeB 


PA2105 


9948119 


probable acetyltransferase 






PA2107 


9948121 


hypothetical protein 






PA2110 


994812-^ 


hypothetical protein 






PA2118 


9948133 


06-methylquanine-DNA methyltransferase 


ada 




PA2119 


9948134 


alcohol dehydroqenase (Zn-dependent) 




adn 


PA2120 


9948135 


hypothetical protein 






PA2123 


9948138 


probable transcriptional regulator 






PA2126 


9948142 


. conserved hypothetical protein 






PA2131 


994ol4/ 


' hypothetical protein 






PA2136 


9948153 


\ hypothetical protein 






PA2142 


99481 5£ 


) probable short-chain dehydrogenase 




.yh^ 


PA2143 


99481 6C 


) hypothetical protein 






PA2146 


994816^ 


\ conserved hypothetical protein 


i 


vciG 


PA2149 


994816*; 


^ hypothetical protein 


j 




PA2157 


99481 7J 


i hypothetical protein 


> 
t 
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PA2161 


9948180 


lypothetical protein ! 






IPA2166 


99481 86 


lypothetical protein 






iPA2167 


9948187 


lypothetical protein ' - - 


iPA2170 


9948190 


iVDOthetical protein ; , - . 


PA2171 


9948191 


hypothetical protein 




i 


PA2174 ! 


9948194 


lypothetical protein 1 






PA2175 


9948195 


hiypothetical protein , 






PA21 82 : 


9948203 


hvDothetlcal protein 






IPA9183 ^ 


9948204 


hypothetical protein * ? . 


5 nMZ 1 OH \ 


9948205 


conserved hypothetical protein j 






9948206 


hypothetical protein \ i ! 


r 1 OD 


9948207 


hvDOthetical protein | ' 




9948208 


hvDOthetical protein 1 






994821 1 


conserved hypothetical protein 






D A O -i QO * 


9Q48214 


conserved hypothetical protein 








Q948218 


probable transcriptional regulator 






DAO-1 Q7 


QQ4R91Q 


conserved hypothetical protein 




ycnB 


o Aoont; 


QQ4.fl998 


hypothetical protein 






D Aoonv 


QQ4ft9'^0 


hypothetical protein 






DAOO-I 


QG A f{0%A 


conserved hypothetical protein 










probable MPS transporter 






■'A2219 




membrane protein OpdE 


opdE 




^A2220 




probable transcriptional regulator 




oprR 


PA2221 


QOASkOA^ 

yy*joz*to 


conserved hypothetical protein 






PA2222 


QQilft947 

• yy*toz*f/ 


hypothetical protein 




O A OOOO 

PA2223 


QQAROAfK 

yy*foz**o 


hypothetical protein 






PA2224 


yy*fozH-v7 


hypothetical protein 








QQ4ft9<^n 


hypothetical protein 










hypothetical protein 






D A0007 




probable transcriptional regulator 








QQ4ft9'S'^ 


hypothetical protein 






DAOOOQ 


9948254 


conserved hypothetical protein 


yiiM 




9948259 


probable exopolysaccharide transporter 


1 


D AOO/IO 


994R268 


hypothetical protein 


( 


DAOOA'^ 


9948271 


hypothetical protein 


i 


n/SiCZO 1 


9948278 


hypothetical protein 


i 




9948280 


L-asparaginase 1 


ansA i 




9948284 


pyoverdine biosynthesis protein PvcD 


pvcD 1 




9948285 


transcriptional requlator PtxR 


plxR i 


nA\ZZDU 


9948288 


hypothetical protein 








9948309 


conserved hVDOthetical protein 




arsH 




9948312 


hypothetical protein 


i 




PA99ft/l 


9948314 


hypothetical protein 


\ 


i 


PA99Q9 


9948323 


hypothetical protein 






PA2293 


9948324 


hypothetical protein 






PA2294 


994832S 


probable ATP-binding component of ABC tran 


s 




PA2295 


9948326 


; probable permease of ABC transporter 






PA2297 


9948326 


\ probable ferredoxin 






PA2298 


994832S 


\ nrobable oxidoreductase 




\ ' 


PA2303 


9948335 


) hypothetical protein 


f — < 


PA2311 


994834^ 


1 hvDOthetical orotein 


T — ■■ i 


PA2316 


994834S 


) probable transcriptional regulator 


i 
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!PA2329 , 


c»P4R:^fi4: Drobable ATP-bindina coniDonent of ABC trans 




\ . j 


IPA2331 


9948366 \ 


wDothetical orotein • : „. — : 


IPA2336 


9948371 \ 


ivDOthetical orotein 






jpA2338 i 


Qq4fl:^74iDrobable bindina orotein component of ABC m; 




(rntlE i 


iPA2343 


c5Q48379lxvIulose kinase rn^Y 




•PA2347 


9948384 1 hypothetical protein ; 






jPA2349 


QQ4fl:^86i conserved hvoothetical protein 






1 


iPA2351 i 


9948388 


□robable permease of ABC transporter 


, ; 


PA2365 1 


9948403 


conserved hypothetical protein \ 




i 

1 


PA2367 1 


9948406 


hvoothetical orotein 1 






PA2368 ! 


9948407 


hypothetical protein 1 




PA2370 1 


9948409 


hvoothetical orotein • 




PA2372 


9948411 


hypothetical protein 






PA2375 


9948414 


hypothetical protein i 






PA2383 


9948423 


orobable transcriptional regulator ! 






PA2391 


9948432 


probable outer membrane protein | 




PA2405 


9948449 


hypothetical protein 




PA2406 


9948460 


hypothetical protein ! 




PA2411 


9948455 


probable thtoesterase ! 




PA2412 


9948456 


conserved hypothetical protein 1 




PA2418 


9948463 


hypothetical protein 




PA2422 


9948467 


hypothetical protein I 




PA2427 


9948473 


hypothetical protein j 




PA2428 


9948474 


hvoothetical orotein i 




PA2429 j 


9948475 


hypothetical protein 






PA2434 


9948480 


hvoothetical orotein 






PA2436 


99484821 hypothetical protein 






PA2441 


9948488 hypothetical protein 






PA2442 


9948489 glycine cleavage system protein T2 


gcvT2 




PA2446 


9948494 glycine cleavage system protein H2 1 


gcvH2 




PA2451 


99484991 hypothetical protein ' 






PA2453 


9Q4flfin9 hynothGtical protein 






PA2455 


9948504 j hypothetical protein j 




PA2456 


9948505 j hypothetical protein 1 




TA2459 


9948508 hypothetical protein \ 




PA2460 


QQ4ftf;nQ hvpnthPtiral protein j 




PA2461 


QQ^ftRinlhvnnthetical orotein 


i 




i 


PA2464 


9948514 hypothetical protein 1 




PA2467 


QQ4ft.Ri 7 probable transmembrane sensor i 




PA2469 


9948519 


probable transcriptional regulator 


1 




PA2473 


9948524 


probable glutathione S-transferase 


i 

f- 




PA2474 


9948525 


hypothetical protein 


\ 




PA2475 


9948526 


► probable cytochrome P450 


1 




PA2485 


9948537 


' hypothetical protein 


t 




PA2487 


g94853€ 


I hypothetical protein 


1 

1 




PA2490 


9948542 


\ conserved hypothetical protein 




YdbB 


PA2491 


9948542 


\ probable oxidoreductase 






PA2492 


994854^ 


\ transcriptional regulator MexT 


mexT 




PA2496 


994864S 


) hypothetical jDrotein 






PA2500 


994855J 


\ nrobable MFS transDorter 




cynX , 


PA2501 


994855^ 


\ hypothetical protein _ f 


1 i 

\ 1 


PA2504 


9948551 


r hypothetical protein > 
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PA2507 


yy**o0D 1 V. 


;atechol 1 ,2-dioxygenase icatA ; 


1PA2515 


ftQ/«QC/2Q f 


;is-1 .2-dihydroxycyclohexa-3.4-diene carboxyibcylL 


IPA2517 


yy^oo/ 1 I 


Qluate 1,2-dioxygenasebetasubunit jxylY 


IPA2521 


yy*fOO / D 


RND divalent metal cation efflux membrane fulczcB : j 


IPA2536 


yy*tooyo 


probable phosphatidate cytidylyltransferase 


iynbB : 


PA2538 


yy4ooyo 


lypothetlcal protein ; 






'A2539 


yy^^ooyo v 


conserved hVDothetical protein 


iynbD 1 


^A2544 


yy*foou^ 


wpnthetical protein J ,. ' ..... \ 


PA2549 ; 


9y4oDUo < 


[conserved hVDothetical protein ] \\ 


/QiT i 


PA2551 i 


9940010 


probable transcriptional regulator j 






PA2552 • 


994861 1 


probable acyl-CoA dehydrogenase _ f_ 




acdB 


PA2553 , 


9948612 


probable acyl-CoA thiolase 






PA2554 i 


9948613 


probable short-chain dehydrogenase 




PA2577 I 


9948639 


probable transcriptional regulator 




PA2584 i 


9948647 


CDP-diacylglycerol-qlycerol-3-phosphate 3-pf 


pgsA ! 


PA2591 i 


9948655 


probable transcriptional regulator 




PA2602 


9948667 


-i\inofhoti<^dt nrotf^in 


\ 




PA2605 


9948670 


conserved hypothetical protein 




yheN 


PA2606 ! 


9948671 


conserved hypothetical protein 




yheM 


PA2607 


9948672 


conserved hypothetical protein 




yccK 


PA2608 
PA2612 
PA2614 
PA2615 


9948673 
9948677 
9948680 
9948681 


seryl-tRNA synthetase 
cell division protein FtsK 


serS 

lolA 

ftsK 




PA2617 i 
PA2619 
PA2621 1 


9948683 
9948685 
9948687 


leucyl/phenylalanyl-tRNA-protein transferase 
initiation factor 

conserved hypothetical protein 


aat 
infA 




PA2626 


9948693 


tRMA methyltransferase 


trmU 


asuE 


PA2629 


9948696 


adenylosuccinate lyase 


purB 




PA2638 


9948706 


NADH dehydrogenase 1 chain B 


nuoB 




PA2641 
PA2645 
PA2646 


9948709 
9948714 
9948716 


NADH dehydrogenase 1 chain F 

fslADH dehvdrooenase 1 chain J 
NADH dehydrooenase 1 chain K 


nuoF 
nuoJ 
nuoK 


i 


PA2658 
PA2663 
PA2666 
PA2667 


9948728 
9948734 
9948737 
9948738 


hypothetical protein 
hypothetical protein 

probable 6-pyruvoyl tetrahydrobiopterin syntha 
conserved hypothetical protein 




Dtps 


PA2e68 
PA2673 


9948739 
9948744 


hypothetical protein 

probable type 11 secretion system protein 
1 probable type II secretion system protein 




hDiV i 
hplU 


PA2674 
PA2675 


9948745 
9948746 


probable type II secretion system protein 




hpIT 


PA2678 
PA2681 
PA2683 


9948749 
9948753 
9948755 


probable pennease of ABC-2 transporter 
\ probable transcriptional regulator 
} probable serine/threonine dehydratase, degra 


cl 


tdcB 


PA2689 
PA2690 
PA2694 
PA2697 


9948762 
9948762 
9948767 
9948771 


\ hypothetical protein 

\ probable transposase 

' probable thioredoxin 
hypothetical protein 


1 




PA2703 
PA2706 
PA2715 
IPA2719 


994877"! 
9948780 
994879( 
g94879i 


^ hypothetical protein 
) hVDothetical protein 
) probable ferredoxin 
5 hypothetical protein 


1 

\ 

1 

( 

\ 
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PA2720 



9948796 



PA2721 



PA2722 



PA2723 



PA2726 



PA2730 



PA2731 



PA2733 



PA2734 



9948797 



9948798 



9948799 



9948802 



9948807 



hypothetical protein 



hypothetical protein 



hypothetical protein 



hypothetical protein 



probable radical activating enzyme 
hypothetical protein 



9948808 1 hypothetical protein 



994881 01 conserved hypothetical protein 



994881 1 [hypothetical protein 



PA2736 i 9948814 hypothetical protein 



PA2737 



PA2738 



994881 5 ic<3nserved hypothetical protein 



9948816|integration host factor, alpha subunit 



himA 



PA2739 I 9948817 phenylalanyl-tRNA synthetase, beta subunit 



PA2740 



PA2741 



pheT 



9948818 phenyialanyl-tRN A synthetase, alpha-subunit 

9948819 50S ribosomal protein L20 



pheS 



rpIT 



PA2742 



PA2743 



PA2744 
PA2749 



9948820 SOS ribosomal protein L35 
9948821 1 translation initiation factor IF-3 



rpml 



infC 



9948822|threonyl4RNA synthetase 



thrS 



PA2753 



9948828 1 PNA-specific endonuclease I 



endA 



9948832 hypothetical protein 



99488.35 hypothetical protein 



PA2759 



PA2762 



PA2763 



PA2767 



99488391 hypothetical protein 



9948842 hypothetical protein 



9948843 hypothetical protein 



9948847 probable enoyl-CoA hydratase/isomerase 



9948848 1 hypothetical protein 



9948849 1 hypothetical protein 



PA2774 



9948855 1 hypotheticar protein 



9948856 1 hypothetical protein 



PA2780 



9948861 1 hypothetical protein 



9948862 hypothetical protein 



PA2782 



9948863 hypothetical protein 



9948866 hypothetical protein 



PA2785 



9948867 conserved hypothetical protein 



9948868 hypothetical protein 



PA2792 



PA2794 



PA2797 



PA2799 



9948874 1 hypothetical protein 



9948877 1 hypothetical protein 



99488801hypotheticaTprotein 



99488821 hypothetical protein 



9948883 1 conserved hypothetical protein 



vacJ 



PA2803 



PA2805 



PA2807 



PA2808 



PA2811 



PA2818 



PA2819 



PA2827 



PA2829 



PA2831 



IPA2832 
IPA2839 



99488861 hypothetical protein 



9948888 1 hypothetical protein 



9948891 Ihypotheticalprotein 



9948892 1 hypothetical protein 



9948895 probable permease of ABC-2 transporter 



yadH 



9948902 1 hypothetical protein 



9948903 1 hypothetical protein 



994891 2 [conserved hypothetjcal protein 



994891 4 1 hypothetical protein 



994891 7 1 conserved hypothetical protein 
9948 91 8 thiopurine methyltransferase 



9948925 [conserved hypothetical protein 



yeaA 



tpm 
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iPA2843 



PA2845 



PA2851 



99489301 probable aldolase 

9948932 1 hypothetical protein 

9948938 1 translation elongation factor P 



•efp 



PA2852 



9948939 1 hypothetical protein 



PA2853 !" 9948941 1 outer membrane lipoprotein OprI precursor ]opr\ 



PA2854 



99489421 conserved hypothetical protein 



lerfK 



PA2859 



PA2863 



PA2868 



PA2874 I 



PA2877 



PA2879 



PA2883 



9948947|transcription elongation factor GreB 

9948951 1 lipase nrioduiator protein 

9948957 j hypothetical protein 



jgreB 



:lipH 



99489631 hypothetical protein 
QQiiRCifiR orotidine 5*-DhosDhS 



PA2876 1 9948966 iorotidine g-phosphate decarboxylase 



IpyrF 



99489671 probable transcriptional regulator 



99489691 probable transcriptional regulator 



hpkR 



9948973 1 hypothetical protein 



PA2894 



99489851 hypothetical protein 



PA2898 



9948990 1 hypothetical protein 



PA2901 



PA2916 



PA2922 



PA2g28 



9948993 hypothetical protein 



PA2910 j 9949003 conserved hypothetical protein 
DA2915 1 9949008 hypothetical protein 



yebN 



99490101 hypothetical protein 



99490161 probable hydrolase 



99490231 hypothetical protein 



PA2935 



PA2936 



PA2937 



9949030 1 hypothetical protein 



9949031 1 hypothetical protein 



99490331 hypothetical protein 



PA2940 



PA2949 



99490361 probable acyl-CoA thiolase 



99490461 probable lipase 



PA2951 



PA2952 



9949048 1 electron transfer flavoprotein alpha-subunit 



etfA 



99490491 electron transfer flavoprotein beta^bunit 



etfS 



PA2953 



PA2960 



99490501 electron transfer flavoproteln>ubiquTnone oxide 
99490581 type 4 fimbrlal biogenesis protein PilZ 



piiZ 



PA2961 



PA2962 



PA2963 



PA2966 



9949059 PNA polymerase III, delta prime subunit 



hole 



9949060 thymidylate kinase 



tmk 



9949061 1 conserved hypothetical protein 



9949064 jacyi carrierprotein 



acpP 



yceG 



PA2967 



PA2968 



9949065 3~oxoacyl-[acyl-carrier'proteinl reductase 



fabG 



9949066 1 malbnvl-CoA-iacyl-carrier-proteinl transacylasMabP 



PA2970 



9949069 1 SOS ribosomal protein L32 



rpmF 



PA2971 



9949070 1 conserved hypotheticalproteln 



yceP 



PA2975 



PA2977 



PA2978 



PA2979 



PA2980 



PA2981 



9949074 1 ribosomal large subunit pseudouridine synthai 



riuC 



99490761 UPP-N>acetylpynJvoylglucosamine reductase 



9949077 1 phosphotyrosine protein phosphatase 



9949078 i 3-deoxy>manno-octulosonate cytidyiyitransfergkdsB 



9949079 1 conserved hypothetical protein 



9949080 tetraacyldisaccharide 4*-kinase 



murB 



yceC 



PtpA 



IpxK 



ycaR 



PA2982 



9949081 [conserved hypothetical protein 



PA2983 



PA2985 



9949082 1 probable tolQ-type transportprotein 



9949085 Ihypothetical protein 



PA2986 



99490861 conserved hypothetical protein 



PA2987 



PA2988 



PA2989 



9949087 



9949088 



9949089 



probable ATP-binding component of ABC trang 



conserved hypothetical protein 



ycfV 



hypothetical jjrotein 
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9949091 s 


oluble pyridine nucleotide transhydrogenase !sth . j 




9949092 Y 


lypothetical protein i 




9949096 r 


sJa+-transIocatinq NADH:uniquinone oxidoredmqrD j _j 




9949102 C 


)robable glyceraldehyde-3-phosphate dehydrq j 








)robable nucleoside phosphorylase 1 ! 


^ j- 




9949111 \ 


lypothetical protein ! i 


_ 1 


iPA3011 


9949113 1 


DNA topoisomerase 1 jtopA i 




IPA3012 


9949114 


lypothetical protein 


— - ■ — J 


PA3017 


9949120 < 


::onserved hvDothetical protein < 


1 


PA3021 


9949124 


lypothetical protein 




— . \ 


PA3022 


9949125 


hypothetical protein ^ 




j 


PA3024 ! 


9949127 


rnhable carbohydrate kinase 




I 


PA?in?^n ; 


9949134 


orobable molvbdooterin-Quanlne dinucleotide b 




PA3033 I 9949137 


ypothetlcal protein 






PA3036 j 
PA3040 


9949140 
9949145 


ypothetical protein 
consen/ed hypothetical protein 




yqiD 
yqjE 


PA3041 
PA3042 
PA3046 \ 
PA3049 


9949146 
9949147 
9949151 
9949155 


hypothetical protein 
hypothetical protein 
conserveQ nypouieuwai piuiciii 
ribosome nnodulatlon factor 


rmf 


p 


PA3051 
PA3067 
PA3081 


9949157 
9949174 
9949189 


hypothetical protein " 

probable transcriptional regulator 

conserved hypothetical protein 






PA3086 


9949193 


hypothetical protein 






PA3088 
PA3089 


9949197 
9949198 


conserved hypothetical protein 

hypothetical protein 




y^B , 


PA3093 
PA3095 


9949202 
994c7^UO 


hypothetical protein 

general secretion pathway protein M 


xcpZ 




PA3096 


9949206 


ncan^rsii Qpcrptinn nathwav orotein L 


xcpY 




PA3100 


9949210 


general secretion pathway protein H 


xcpU 


DddB 


PA3103 
PA3110 


9949213 
9949221 


general secretion pathwav protein E 
lypuincuucii {jiuidii 


xcpR 




PA3112 
PA3117 


9949223 
9949229 


acetyl-CoA carboxylase beta subunit 
aspartate semialdehyde dehydrogenase 


accD 
asd 




PA3123 


9949235 


conserved hypothetical protein 






PA3140 


9949254 


hypnthetinal protein 






PA3142 


9949256 


hypothetical protein 






PA3144 


9949258 


hypothetical protein 






PA3145 


9949259 


glycosyltransferase WbpL 


wbpL 
^bpK 




PA3146 
PA3147 


9949260 
9949261 


probable NAD-dependent epimerase/dehydral 
probable qlycosyl transferase WbpJ 


wbpJ 




PA3148 


9949262 


probable UDP-N-acetylq!ucosamine 2-epimer 


awbpi 




PA3149 


9949262 


\ probable glycosyltransferase WbpH 


wbpH 




PA3150 


994926^ 


\ LPS biosynthesis protein WbpG 


wbpG 




PA3151 


9949266 


> imidazoleglycerol-phosphate synthase, cyclas 


6hisF2 




PA3152 


9949267 


^ glutamine amidotransferase 


hisH2 




PA3153 


994926£ 


J O-antigen translocase 


wzx 


wbpF. rfbX 


PA3154 
PA3155 


994926£ 
994927( 


) B-band O-antigen polymerase 

) probable aminotransferase WbpE 


wzy 

wbpE 

wbpD 


rfc 


PA3156 
PA3157 
PA3158 


994927' 
1 994927: 
j . 994927: 


probable acetyltransferase WbpD 
I probable acetyltransferase 
3 probable oxidoreductase WpbB 


iwbpB 


wbpC 
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iPA3159 9949274 


probable UDP-glucose/GDP-mannose dehydrcwbpA | j 


1PA3160 9949276 


O-antigen chain length regulator !wzz jcid, rol | 


}PA3161 


9949277 


ntegration host factor beta subunit jhimD ; i 


IPA3162 


9949278 


30S ribosomal protein S1 jrpsA ; __ \ 


{PA3163 


9949279 


cytidylate kinase :cmk \ i 


jPA3167 9949283 


3-phosphoserine aminotransferase IserC j 


!pA3168 9949285 


DNA gyrase subunit A IgyrA ? 


1PA3171 ; 9949288 


3-demethylubiquinone-9 3-methyltransferase jublG \ i 


1 PAS 178 9949296 


hVDothetical protein 1 j i 


PA3181 ] 9949299 


2-keto-3-deoxY-6-phosphogluconate aldolase 


edaA 


PA3185 9949303 


hypothetical protein 




PA3195 i 


9949314 


glyceraldehyde 3-phosphate dehydrogenase 


gapA 


• 


PA3202 j 


9949322 


conserved hypothetical protein 




ycil 


PA3203 I 


9949323 


hypothetical protein 






PA3207 


9949327 


hypothetical protein 






PA3211 


9949331 


probable permease of ABC transporter 






PA3220 


9949341 


probable transcriptional regulator 






PA3227 


9949348 


peptidyi-prolyl cis-trans isomerase A 


ppiA 


cypH j 


PA3230 


9949352 


conserved hypothetical protein 






PA3232 


9949354 


probable nuclease 






PA3237 


9949359 


hypothetical protein 






PA3242 


9949365 


probable lauroyi acyltransferase 




htrB 


PA3245 


9949368 


cell division topological specificity factor MinE 


minE 




PA3246 


9949369 


pseudouridine synthase RluA 


rluA 


yabO 


PA3249 


9949372 


probable transcriptional regulator 






PA3255 


9949379 


hypothetical protein 






PA3260 


9949384 


probable transcriptional regulator 






PA3266 


9949391 


cold acclimation protein B 


capB 


cspA 


PA3273 


9949398 


hypothetical protein 






PA3274 


9949399 


hvDothetical orotein 






PA3275 


9949401 


conserved hvDothetical orotein 


ynfA 


PA3278 9949404 


hypothetical protein 






PA3280 


9949406 


outer membrane porin OprO precursor 


oprO 




PA3281 


9949407 


hypothetical protein 






PA3287 


9949414 


conserved hypothetical protein 






PA3288 


9949415 


hypothetical protein 






PA3289 


9949416 


hypothetical protein 






PA3291 


9949418 


hypothetical protein 






PA3292 


9949419 


hypothetical protein 






PA3298 


9949426 


hypothetical protein 






PA3302 


9949430 


conserved hypothetical protein 






PA3309 


9949438 


conserved hypothetical protein 






PA3312 


9949441 


probable 3-hydroxyisobutyrate dehydrogenase 






PA3314 


1 9949443 


probable ATP-binding component of ABC tran 






PA3315 


9949444 


probable permease of ABC transporter 






PA3317 


9949447 


hypothetical protein 






PA3318 


9949448 


hypothetical orotein 






PA3320 


994945C 


1 hypothetical protein 






PA3326 


9949457 


' probable CIp-family ATP-dependent protease 






PA3330 


9949461 


orobable ^hort chain dehydrogenase 




1 i 


|PA3332 


9949462 


\ conserved hvoothetical protein ! 




IPA3334 


9949465 probable acyl carrier protein * 


1 
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ii r\oooo ! 


9949470 1 


WDotheticai orotein i - - ' 


j r r\00*T > 1 


9949473 


probable transcriptional regulator 








9949480 


hiVDOthetical orotein 






f nr\oo*to J 


9949481 


probable chemotaxis protein methyltransferase | 


cheR1 




9949484 


hypothetical protein 


IfIgM ; 




Q949486 


hypothetical protein 


1 
1 






9949487 


hypothetical protein 


T 






9949494 


probable secretion protein 






■'MOOD 1 1 


9949502 


hypothetical protein [ 


iydcA , 1 


-'AOODO 


QQ49503 


probable acetyltransferase 




-'Ao370 1 




hypothetical protein > i ! 


^Aoo71 




hypothetical protein j 




— — — ■ ' ■ 


■'A33o0 




conserved hypothetical protein 




phnG 


PA33o4 




ATP-binding component of ABC phosphonate 


phnC 




-*A3390 


yy'fyo^o 


hypothetical protein 






^A339o 


yy*iyoo i 


NosY protein 


nosY 





PA339D 


yy*fyoo^ 


NosL protein 


nosL 




^A3397 


yy*tyoo*t 


ferredoxin-NADP+ reductase 


fpr 




r*i A o >i 

•'A3403 


yy*tyo*n/ 


hypothetical protein 






PA3407 


yy^yono 


heme acquisition protein HasAp 


hasAp 


- ■ 


PA3413 


fXQyt ACC->1 


conserved hypothetical protein 




yebG 


PA3414 


C\f\A QCCO 

994900^ 


hypothetical protein 






PA3416 


9949004 


probable pymvate dehydrogenase El cpmpon 






PA3432 


9949 0 / 


hypothetical protein 




■ — 


PA3433 


AA>1 

99490 / 0 


probable transcriptional regulator 




yv^bl 


PA3434 


A A y* ACTvl 

9949574 


probable transposase 






PA3435 


99490/0 


conserved hypothetical protein 






PA3438 


yy*iyOf D 


GTP cycldhydrolase 1 precursor 


folE1 





PA3439 


yy^yo / y 


d-erythro-7.8-dihydroneopterin triphosphate 


foIX 




PA3443 


yy*»yoo*t 


probable permease of ABC transporter 






PA3444 


yy*iyooo 


consen/ed hvoothetical protein 






n A A c 

PA3440 


yy^yooo 


conserved hypothetical protein i 




n A ^ A AC 

PA344D 


yy^yoo / 


conserved hypothetical protein J 


eeup 


n A ^ A cr\ 

PA3450 


yy'i-yoy 1 


probable antioxidant protein 


IsfA 


PA3451 


QO/10RO0 
yy^yoyz 


hypothetical protein 






PA34dO 


yy^you^ 


probable acetyltransferase 






n A^ A "7 A 

PA347U 




hypothetical protein 






PA3472 


yy^yo id 


hypothetical protein 






PA3477 


QQAQfiOi 


transcriptional regulator RhIR 


rhIR 






QQ4Qfi97 


methionyl-tRNA synthetase 


metG 




nAo4oo ■ 




hypothetical protein 






KAo4oy 




conserved hypothetical protein 




mfA 


rAo4y^ 




conserved hypothetical protein 




rnfD 


rAOHy4 




conserved hypothetical protein 




mfE 


PA3495 


9949641 


endonuclease III 


nth 




PA3496 


9949642 


hypothetical protein 






PA3501 


9949647 


hypothetical protein 






PA3502 


9949641; 


i hypothetical protein 






PA3505 


9949652 


[ hypothetical protein 


1 
» 




iPA3512 


994965S 


) orobable oennease of ABC transporter 


i 




jPA3519 


9949667 


' hypothetical protein 




ipA3520 


9949668 


hypothetical protein i ! 1 
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PA3523 


9949671 If 


DPobable RND efflux membrane fusion proteini . ' 


PA3528 


9949677 \ 


ibonudeaseT imt ; 




;PA3530 


9949679 ( 


ijonserved hypothetical protein i 


jbfd I 


PA3533 


9949682 < 


conserved hypothetical protein i 


lydhD 


;PA3542 


9949693 i 


alginate biosynthesis protein Alg44 ; 


|alg44 | 


JPA3550 


9949702 i 


alginate o-acetyltransferase AlgF algF _ J 


;PA3558 


9949710 


lypothetical protein i 


) 


, - i 


IPA3566 


9949719 < 


conserved hypothetical protein ; 


lycnE ! 




9949724 


fnethylmalonate-semiaidenyae aenyarogenase 


mmsA ! i 




9949726 


hypothetical protein 1 


J i 


/AO %j i %J j 


9949729 


hypothetical protein S 







■'MOO / O 


9949732 


conserved hypothetical protein s 








9949744 


III 1 A It— * - 1 — — — 

probable acyl-CoA thiolase 1 








9949756 


conserved hypothetical protein 




IOC* 

rploo 


MODU 1 1 


9949757 


conserved hypothetical protein 




ykgM 




9949762 


hypothetical protein 








9949763 


conserved hypothetical protein 




yfiP . 


DA'^^nO 1 


9949768 

WW"TW f WW 


polyamine transport protein PotC 


potC 






9949768 

WW*Tw 1 WW 


hypothetical protein 






MOD ij^ 


9949769 


conserved hypothetical protein 




vpeB 


^MOD ID 


QQ49774 


conserved hypothetical protein 




recA 


-'Mod I / 


QQ4Q77'S 


RecA protein 


recA 






QQ4Q7R5 


conserved hypothetical protein 




ygbB 


r'AoDO^ 


QQ4Q7Q1 


conserved hypothetical protein 




yedF 


-'AODOO 


QQ4Q7Q9 


conserved hypothetical protein 




ygbP 


r*AoDO*f 




conserved hypothetical protein 




ybgQ 


r^MODOO 


9Q4Q794 

*7w*Tw 1 W*T 


enolase 


eno 




■'MODOD 


QQ49795 

WW*TW f WW 


2-dehydro-3-deoxyphosphooctonate aldolase 


KdSA 




P A0C07 

■^MODO / 


QQ49796 


CTP synthase 


pyrG 




pAOftQQ 

r^MODOO 


Q949797 

9w*T w f W # 


conserved hypothetical protein 




j — — 


rMODO*? 


9949798 


acetylKJoenzyme A carboxylase carboxyi trans 


accA 




PA^R^n 
r^MOO*Tw 


9949800 

W w~ W w WW 


DNA polymerase ML alpha chain 


anatz 


poi^ 1 


PA^RZ*^ 


9949803 


lipid A-disaccharide synthase 


IpxB 




"MOD***! 


9949804 

w w~ w^ w~ 


U DP-N-acetylglucosamine acyitransferase 


IpxA 




PA'^fiil'^ 
r^MOD*TO 


9949805 

W W~ w W WW 


(3R)-hydroxymyristoyl-[acyl carrier protein] del 


dfabZ 


sefA 




9949806 


UDP-3-0-[3-hydroxylauroyl) qlucosamine N-ac 


IpxD 


firA omsA 


PA'^R47 

r MOO*T / 


9949807 


probable outer membrane protein precursor 






~MOO*tO 


9949808 


probable outer membrane protein 






PA365n 


994981' 


1-deoxy-d-xylulose 5-phosphate reductoisom€ 


^dxr 


yaeM 


PA3651 


9949812 


phosphatidate cytidylyltransferase 


cdsA 






9949813 


undecaprenyl pyrophosphate synthetase 


uppS 


yaeS 




9949814 


ribosome recycling factor 


frr 


rrf 


PA3654 


9949815 


uridylate kinase 


pyrH 


smbA 


PA3655 


9949816 


elongation factor Ts 


tsf 




PA3656 


9949817 


30S ribosomal protein S2 


rpsB 




PA3657 


9949818 


methionine aminopeptidase 


map 




PA3662 


9949824 


[ hypothetical protein 






PA3664 


9949826 


I conserved hypothetical protein 






PA3666 


9949828 


\ tetrahvdrodiDioolinate succinylase 


dapD 




iPA3671 


9949832 


\ probable permease of ABC transporter 




IPA3674 


9949837 


hypothetical protein ! 




iPA3678 


9949841 


probable transcriptional peaulator i 
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iPA3681 



PA3684 



PA3685 



PA3688 



PA3693 



PA3701 



PA3725 



3A3730 i 9949897 hypothetical protein 



PA3731 



PA3733 



9949844! hypothetical protein 



9949848 



9949849 



9949852 



9949858 



hypothetical protein 



conserved hypothetical protein 



hypothetical protein 



conserved hypothetical protein 



peptide chain release factor 2 



PA3704 ^ 9949868 1 probable chemotaxis sensor/effector fusion pro 



IprfB 



PA3719 ! 9949885 hypothetical protein 



9949891 sing^le-stranded-DNA-specific exonuclease RejorecX 



PA3726 ■ ; 9949892 1 conserved hypothetical protein 



yaeQ 



9949898 Iconsen/ed hypothetical protein 



9949900 hypothetical protein 



PA3742 



PA3743 



9949910 50S ribosomal protein L19 



rpIS 



9949911 tRNA (guanine-NI)-methyltransferase 



trmP 
rimM 



PA3744 



99499121 16S rRNA processing protein 



PA3745 



994991 3 1 SOS rlbosonial protein SI 6 



rpsP 
ffh 



PA3746 



PA3752 



PA3754 



9949914 signal recognition particle protein Ffh 



9949921 [hypothetical protein 



9949923 1 hypothetical protein 



yeaB 



PA3756 



99499251 hypothetical protein 



yafK 



PA3759 



9949928 1 probable aminotransferase 



PA3765 



99499351 hypothetical protein 



PA3767 



PA3769 



PA3773 



PA3776 



99499371 conserved hypothetical protein 



99499401 GMP synthase 



9949944 1 hypothetical protein 



9949947 probable transcriptional regulator 



yfhC 



guaA 



PA3777 



9949948 1 exodeoxyribonuclease VII large subjjnjt^ 



xseA 



PA3782 



9949954 probable transcriptional regulator 



PA3784 



PA3785 



9949956 hypothetical protein 



9949957 1 conserved hypothetical protein 



PA3787 



PA3788 



9949959 1 conserved hypothetical protein 
994996 0 1 hypothetical protein 



PA3796 



PA3800 



PA3803 



PA3805 



PA3806 



PA3807 



9949969 1 hypothetical protein 



99499731 conserved hypothetical protein 



9949976 1 conserved hypothetical protein 



9949978 1 type 4 fimbrial biogenesis protein PilF 



9949979 conserved hypothetical proteirt 



99499801 nucleoside diphosphate kinase 



gcpE 



pilF 



yfgB 



ndk 



PA3808 



9949982 Iconsen/ed hypothetical protein 



yfhJ 



PA3809 



PA3810 



9949983 1 fen-edoxin [2Fe-2Sl 



fdx2 



9949984 heat shock protein HscA 



hscA 



PA3811 



PA3812 



PA3813 



PA3815 



99499851 heat shock protein HscB 



hscB 



9949986 probable iron-binding protein IscA 



iscA 



9949987 probable iron-binding protein IscU 



iscU 



99499891 conserved hypothetical protein 



PA3821 



PA3822 



9949995 1 secretion protein SecD 



IsecD 



9949996 [conserved hypothetical protein 



yajc 



PA3827 



1PA3828 



PA3829 
iPA3833 



9950002 1 conserved hypothetical protein 



9950003 1 conserved hypothetical protein 



9950004 



9950008 



hypothetical protein 



hypothetical protein 
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jPA3834 9950Q09ivalyl-tRNA synthetase 

iPA3835 ' 9950Q1 01 hypothetical protein 



PA3840 ' 


995001 61 conserved hypothetical protein ; 


lybiN ; 




995001 9 } hypothetical protein 


1 

. . — i — — — i 


PA3850 i 


Qpfi0027l hypothetical protein \ 


J 


pAOOC-1 


qQ«;n098 1 


lypothetical protein • i 




/A005JM t 




lypothetical protein ■; i 






yyouuoo i 


riypothetical protein 1 | 






yyouuo*!- y 


conserved hypothetical protein 1 ! 




DAOQCQ 1 


yyouuor 


probable carboxylesterase 






A O OC*7 1 

AooD / 


yyouu*fD 


probable DNA invertase > 






PAOODO j 


yyouu4f 


hypothetical protein ' 






PA3od9 i 


yyouu4o 


hypothetical protein 




1 


^Aoo7o I 


9950000 


nitrite extrusion protein 2 






n A o o o /I ) 


99500b4 


hypothetical protein 






A o o 1 

-'ASS 86 1 


9960066 


hypothetical protein 






^A388o 


9950069 


probable permease of ABC transporter 






PA3890 


9960071 


probable permease of ABC transporter 






A3 891 


9950072 


probable ATP-binding component of Abo tran 


> 




^A3892 


9950073 


conserved hypothetical protein 






■^A3904 


9950087 


hypothetical protein " 






PA3905 


9950088 


hypothetical protein 






=*A3906 


9950089 


hypothetical protein 






PA3911 


9950094 


conserved hypothetical protein 




vhbT 


PA3916 


9950100 


molybdopterin converting factor, large subunit 


mOaC 




PA3917 


9950101 


molybdopterin converting factor, small subunit 


moaD 




PA3918 


9950102 


molybdopterin blosynthetic protein C 


1 1 lUdw 




PA3936 


9950122 


probable permease of ABC taunne iransponer 




tauC 


PA3940 


9950127 


probable DNA binding protein 






^A39d0 


9950149 


hypothetical protein 






PA39o2 


99501 o1 


hypothetical protein 






n A once 


9950154 


probable transcriptional regulator 






PA39o7 


99501 oo 


hypothetical protein . 






PA39o9 


99501 OO 


conserved hypothetical protein 






PAoy/o 


990U1DO 


probable transcriptional regulator 








oQcru ft7 

yyouiof 


glutamate-1-semialdehvde2,1-amlnomutase 


hemL 




PAoy/y 


oQcn«i 7n 


hypothetical protein 






D A OQO'l 


QQCn"170 

yyoul r 


conserved hypothetical protein 




vbeZ 


O AQQQO 

PAoyo^ 


yyou 1 / o 


conserved hypothetical protein 






O A O QQ/l 

PAoyo4 


9950175 


apolipoprotein N-acyltransferase 


Int 


cutE 


KAoyoo 


9950177 


hypothetical protein 








9950178 


leucyl-tRNA synthetase 


leuS 




PA3988 


9950179 


hypothetical protein 




i 


PA3989 


9950180 


DNA Dolvmerase III. delta subunit 


IholA 


___ j 


PA3990 


9950182 


conserved hypothetical protein » 


t 


PA3993 


9950185 


orobabie transDOsase i 


i 


PA3996 


9950188 


I iipoate synthase 


ilipA 


liD 


PA3998 


99501 9C 


1 conserved hvDOthetical protein \ 


ybeD 


PA4002 


995019-4 


rod shaoe-determinina protein 


irodA 


mrdB 


PA4005 


9950197 


' conserved hypothetical protein 


ybeB 


jPA4006 


; 99501 9e 


hypothetical protein 


ybeN 


PA4008 


9950201 


probable hydrolase 
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!PA4012 
!PA4018. 
iPA4019 
;PA4028 



!PA4029 



!PA4031 



PA4033 



9950205! hypothetical protein 



9950211 ! hypothetical protein 



995021 2 1 probable aronnatic acid decarboxylase 
9950222! hypothetical protein 



9950224 



conserved hypothetical protein 



9950226 



inorganic pyrophosphatase 



9950228 



hypothetical protein 



IdedA 
lipyR 



I 



PA4037 



PA4043 



9950232 



jrobable ATP-binding component of ABC tran^ 



9950239 



geranyltranstransferase : 



isEA. 



PA4044 



9950240 



1-deoxyxylulose-5-phosphate synthase 



dxs 



PA4047 



PA4049 



9950243 



GTP cyclohydrolase U 



ribA 



9950245 



hypothetical protein 



PA4050 



9950246 



jhosphatidylglycerophosphatase A 



PA4051 



PA4052 



9950247 



thiamine monophosphate kinase 



thiL 



9950248 



MusB protein 



nusB 



ssyB 



PA4053 



PA4055 



9950249 



9950252 



6.7-dimethyl-8-ribityllumazine synthase 



ribE 



riboflavin synthase alpha chain 



ribC 



ribH 
ribB 



PA4056 



PA4057 



PA4059 



9950253 



riboflavln-speclfic deaminase/reductase 



rIbD 



ribG 



9950254 



conserved hypothetical protein 



ybaP 



9950256 



hypothetical protein , 



PA4060 



9950257 



hypothetical protein 



PA4063 



9950260 



hypothetical protein 



PA4064 



9950261 



probable ATP-bindIng component of ABC tranb 



PA4068 



PA4076 



PA4077 



9950266 



probable epimerase 



9950274 



hypothetical protein 



9950275 



probable transcriptional regulator 



PA4083 



9950282 



probable pili assembly chaperone 



PA4097 



PA4099 



PA4104 



PA4107 



9950298 



probable alcohol dehydrogenase (Zn-depende 



9950300 



hypothetical protein 



9950305 



conserved hypothetical protein 



9950309 



hypothetical protein 



PA4114 



PA4121 



9950317 



9950324 



spermidine acetyltransferase 



bltD 



conserved hypothetical protein 



PA4122 



9950325 



conserved hypothetical protein 



PA4125 



PA4134 



PA4141 



PA4149 



PA4151 



PA4157 



9950329 



5-carboxymethyl-2-hydroxvmuconateisomeraaipcD 



9950339 



9950346 



9950355 



conserved hypothetical protein 



9950357 



9950364 



hypothetical protein 



hypothetical protein 



acetoin catabolism protein AcoB 



acoX 



acoB 



probable transcriptional regulator 



PA4164 



PA4167 



PA4169 



PA4170 



PA4171 



PA4174 



PA4176 



PA4181 



9950372 



9950375 



hypothetical protein 



9950377 



conserved hypothetical protein 



9950378 



9950379 



9950383 



9950385 



9950390 



probable oxidoreductase 



hypothetical protein 



probable protease 



probable transcriptional regulator 



peptidyl-prolyl cis-trans isomerase C2 



ppiC2 



hypothetical protein 



PA4182 



PA4183 



PA4190 



PA4209 



9950391 



9950392 



hypothetical protein 



9950400 



hypothetical protein 



probable FAD-dependent monooxygenase 



9950421 



probable O-methyltransferase 
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:PA4210 
1PA4211 
iPA4212 



9950423 {probable phenazine biosynthesis protein 
9950424! probable phenazlne biosynthesis protein 
"9950425 1 phenazine biosynthesis protein PhzC 



jphzAI 



jphzBI 



!PA4215 . 9950428 1 probable phenazine biosynthesis protein 



rPA4216 



iPA4219 



9950430 probable pyrldoxamlne 5'-phosphate oxidase 
9950433 hypothetical protein 



PA4230 ; 9950446 salicylate biosynthiesis protein PchB 



iPA4232 
IPA4237 



PA4238 



PA4239 



PA4240 



PA4241 



pchB 



99504481 single-Stranded DNA-binding protein 
9950454 50S ribosomal protein L17 



Issb 



IrelQ 



9950455 i DNA-directed RNA polymerase alpha chain TipoA 
9950456 1 303 ribosomal protein S4 )rpsD 



99504571308 ribosomal protein S1 1 rpsK 

995045"8|30S ribosomal protein S1 3 TrpsiVi 



iphzCI 



phzFI 
|phzG1 



PA4242 



PA4243 



9950459 1 50S ribosomal protein L36 



IrpmJ 



9950460 1 secretion protein SeoY 



secY 



prlA 



PA4244 



PA4245 



9950461 1 SOS ribosomal protein L15 



9950462 1 50S ribosomal protein L30 



rpIO 
IrpmP 



PA4246 



PA4247 



PA4248 



PA4249 



PA4250 



99504631305 ribosomal protein S5 



rpsE 



99504641503 ribosomal protein L18 



rpIR 



99504651503 ribosomal protein L6 



9950466 



rplF 



303 ribosomal protein 88 



rpsH 



9950467 303 ribosomal protein 814 



jrpsN 



PA4251 



9950468 



503 ribosomal protein L5 



IrplE 



PA4252 



PA4253 



9950469 



503 ribosomal protein L24 



IrplX 



9950470 



503 ribosomal protein L14 



rpIN 



PA4254 



PA4255 



PA4256 



9950471 



303 ribosomal protein 317 



rpsQ 



9950472 



503 ribosomal protein 129 



rpmC 



9950473 



503 ribosomal protein LI 6 



IrpIP 



PA4257 



PA4258 



PA4259 



9950474 



308 ribosomal protein 33 



rpsC 



9950475 



503 ribosomal protein L22 



rpIV 



9950476 



303 ribosomal protein 819 



|rps3 



PA4260 



PA4261 



PA4262 



PA4263 



PA4264 



PA4267 



9950477 



508 ribosomal protein L2 



rplB 



9950478 



508 ribosomal protein L23 



IrplW 



9950479 



508 ribosomal- protein L4 



9950480 



508 ribosomal protein L3 



Wo 

imi: 



9950482 



303 ribosomal protein 310 



rpsJ 



9950485 



303 ribosomal protein 37 



IrpsG 



PA4268 



9950486 



303 ribosomal protein 312 



IrpsL 



str 



PA4269 



PA4270 



9950487 



3NA-directed RNA polymerase beta* chain j rpoC 



9950488 



DNA-directed RNA polymerase beta chain 



rpoB 



PA4271 



PA4272 



9950490 



SOS ribosomal protein L7 / L12 



rpIL 



9950491 



508 ribosomal protein L10 



rplJ 



PA4273 



9950492 



503 ribosomal protein LI 



rplA 



PA4274 



PA4275 



PA4276 



PA4279 



PA4295 



PA4296 



9950493 



508 ribosomal protein L1 1 



9950494 



transcription antitermination protein NusG 



rplK 



nusG 



9950495 



9950498 



secretion protein SecE 



secE 



hypothetical protein 



9950516 



9950518 



hypothetical protein 



probable two-component response regulator 



priG 



PA4298 



PA4299 



9950520 



9950521 



hypothetical protein 



hypothetical protein 



1PA4305 



9950527 



hypothetical protein 
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:PA4306 


9950529 [ 


lypothetical protein j 




iPA4314 * 


9950538 f 


brmyltetrahydrofolate deformylase 


purUI 




IPA4322 ' 


9950546 < 


:onsen/ed hypothetical protein 1 




PA4324 i 


9950548 1 


lypothetical protein 1 




PA4329 i 


9950554 1 


pyruvate l<inase II 1 


pykA \ 


pyk-ll 


PA4330 i 


9950555 


probable enovl-CoA hydratase/isomerase 1 




PA4341 • 


9950567 


Drobable transcriotional reaulator ! . j 


PA4345 


9950572 


hypothetical protein [ 


i 

\. 


1 


PA4348 


9950575 


conserved hypothetical protein 


J. 


< 

1 


PA4'^9 


9950576 


hypothetical protein 




j 


PA4'^50 ' 


.9950577 


conserved hypothetical protein 






PA4354 * 


9950581 


conserved hypothetical protein 








9950584 


conserved hypothetical protein 




YhqG 




9950586 


conserved hypothetical protein 




feoA 




9950594 


superoxide dismutase 


sodB 






9950602 


hypothetical protein 








9950607 


hypothetical protein 








Q950613 


conserved hypothetical protein 




crcB 




9950615 


GroEL protein 


groEL 


mopA 




9950616 


GroES protein 


^roES 


mopB 




9950618 


hypothetical protein 






DA/l'^ftQ 


9950619 


probable short-chain dehydrogenase " 






D A/1 QQO 


9950623 


conserved hypothetical protein 




vbaZ 


DA^'^Q^ 


9950626 


conserved hypothetical protein 




va|Q 


DA/iiin'^ 


9950635 


secretion protein SecA 


secA 








hypothetical protein 








UDP-3-O-acyl-N-acetylglucosamine deacetyla 


^pxC 


envAasrnB 


DA/lAn7 


9950639 


cell division protein FtsZ 


ftsZ 






9950640 


cell division protein FtsA 


ftsA 






9950641 


cell division protein FtsQ 


ftsQ 




DAA^i 1 


9950643 


UDP-N-acety!muramate— alanine ligase 


murC 




PA4id19 


9950644 


UDP-N-acetylglucosamine— N-aceiyimuramyH 


ImurG 




P A441 3 


9950645 


cell division protein FtsW 


ftsW 






9950646 


U DP-N-acetylmuramoylaianine— Li-giuiamaie i 


murD 




PA441 5 


9950647 


phospho-N-acetyimuramoyi-peniapepuae-udn 


ynraY 


ORF Y 


PA/I41R 


9950648 


UDP-N-acetyirnuramoyiaianyi-u-giuidittyi-^i 


murF 




PA4417 


9950649 


UUr-N-acetyirnuramoyiaianyi-u-giuioi i laic-^, 


piurE 






9950650 


penicillin-binding protein 3 


ftsi 


pbpB 




9950651 


cell division protein FtsL 


ftsL 






9950652 


conserved hypothetical protein 




mraW yabC ylxA 


PA4421 


, 9950653 


conserved hypothetical protein 




vabB 


PA4424 


9950657 


conserved hypothetical protein 




yraN 


PA4425 


9950658 


probable phosphoheptose isomerase 




vraO \ 


PA4427 


9950660 


stringent starvation protein B 


sspB 




PA4428 


9950661 


stringent starvation protein A 


sspA 


ssppog . 


PA4430 


9950663 


\ probable cytochrome b 






PA4432 


gg5066£ 


> 30S ribosomal protein S9 


rpsi 




PA4433 


9950666 


1 508 ribosomal protein LI 3 


rplM 




PA4436 


995067C 


) probable transcriptional regulator 






PA4438 


9950672 


\ conserved hypothetical protein 


i. 


yhcM ! 


PA443.9 


995067c 


\ trvDtoDhanvl-tRNA synthetase 


jtrpS 




'PA4440 


1 9950674 ! hypothetical protein ____ 
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iPA4450 


QQ«;nfiRR '. 1 inP-M-arefYinii ir.nsamine 1 -carboxwinvwang murA 




:PA4452 
•PA4453 


9950687 {< 


conserved hypothetical protein J,, 






9950688 1( 


jonserved hypothetical, protein 






IPA4454 


9950689 ( 


conserved hypothetical protein 


y^bD . 


|PA4455 


9950690 1 


:»robable oernnease of ABC transporter 


!yrt)E -4 


PA4457 ' 


99506931 conserved hypothetical protein • 


{vrbH kDsF \ 


PA4459 


99R0695tconserved hvpotheticai protein i 




/rbK i 


PA4460 


9950698 ( 


3onsen/ed hypothetical protein ' f 




^hhN i 


PA4461 


9950697 


probable ATP-binding component of Abo tranc 




yiiuw * 


PA4462 j 


9950698 


RNA polymerase sigma-54 factor 1 


f puiN 


ntrA 


PA4463 1 


9950699 


conserved hypothetical protein 




y 1 iun 


fSAAARA. 


9950700 


nitrogen regulatory IIA protein 




. 


PA4466 : 


9950702 


probable phosphoryl carrier protein 






PA4471 1 


9950707 


hypothetical protein 






PA4480 


9950718 


rod shape-determining protein MreC 


mreC 




PA4481 


9950719 


rod shape-determining protein MreB 


mreB 


envD roQ t 


PA4482 


9950720 


Glu4RNA(Gln) amidotransferase subunit C 


gate 




PA4483 


9950721 


Glu-tRNA(Gln) amidotransferase subunit A 


qatA 




PA4484 


9950722 


Glu-tRNA(Gln) amidotransferase subunit B 


gaiD 




PA4485 
PA4492 


9950723 
9950730 


conserved hypothetical protein ^ 

conserved hypothetical protein 






PA4499 


9950738 


probable transcriptional regulator 






PA4507 


9950747 


hypothetical protein 






PA4524 i 


9950766 


nicotinate-nucleotide pyrophosphorylase 


naoL/ 




PA4525 


9950767 


type 4 flmbrial precursor PilA „_ 


Mil A 




PA4526 


9950768 


type 4 fimbrial biogenesis protein PilB 


pub 




PA4527 


9950770 


still frameshift type 4 fimbrial biogenesis protej 


piio 




PA4629 


9950772 


consen/ed hypothetical protein 






PA4530 


9950773 


conserved hypothetical protein 






PA4537 


9950780 


hypothetical protein 






PA4544 


9950788 


pseudouridine synthase 


nuu 




PA4547 


9950791 


two-component response regulator PUR 


nilR 




PA4552 


9950797 


type 4 fimbrial biogenesis protein PilW 


nilW 

piivv 




PA4553 


9950798 


type 4 fimbrial biogenesis protein PilX 


pilX 




PA4557 


9950802 


LytB protein 


lyiD 




PA4559 


9950804 


prolipoprotein signal peptidase 


IspA 




PA4560 


9950805 


Isoleucyl-tRNA synthetase 






PA4561 




riboflavin kinase/FAD synthase 


ribF 




PA4563 


9950809 


30S ribosomal protein S20 


rpsT 




PA4564 


9950810 


consen/ed hypothetical prptein 




creA 


PA4565 


9950811 


glutamate 5-kinase 


proB 




PA4566 


9950812 


GTP-bindinq protein Obg 


obg 




PA4567 


9960813 


SOS ribosomal protein L27 


IrpmA 




PA4568 


9950814 


[ SOS ribosomal protein L21 


rplU 






9950815 


) octaprenyl-diphosphate synthase 


ispB 


eel 


PA4574 


9950821 


conserved hypothetical protein 




yqhA 


PA4575 
PA4577 


9950822 
995082^ 


I hypothetical protein 
[ hypothetical protein 






PA4586 
PA4600 


995083^ 
995085( 


\ hypothetical protein 

1 trancsnrintional reaulator NfxB 


nfxB 


J 


PA4603 


995085; 


i hypothetical protein 




PA4610 


995086' 


1 hypottietical protein 


i 
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iPA4611 


9950862 i 


hypothetical protein 




:PA4617 


9950868 c 


:onserved hypothetical protein ; 





iPA4630 


9950883 ( 


lypothetical protein * 




1PA4636 


9950890 \ 


lypothetical protein ? 


• 


IPA4637 


9950891 \ 


hypothetical protein 1 




)PA4638 


9950892 \ 


hypothetical protein ' 




PA4642 


9950895 1 


lypothetical protein ^ i 




ipA4644 1 


9950897 1 


wDothetical protein ! 1 




PAdfi46 


9950899 1 


jracil pnospnoriDOsyitransTerase 


inn 

"rP 






9950903 1 


hypothetical protein 




• 




9950905 


probable pill assembly chaperone 








9950909 


Ferrochelatase 




visA 1 


DAZftR9 1 


9950917 


qlutamate racemase 


murl 




nM*TDOO i 


9950918 


molybdopterin biosynthesis IvIoeB protein 


moeB 


chIN ! 


DA/lftfi'^ i 


9950920 


peptide chain release factor 1 


prT/\ 


rf1 


DAAf^f^f^ 


9950921 


glutamyl-tRNA reductase 


hemA 


hem1; glutR 




9950923 


probable lipoprotein localization protein LolB 




olB 


PAARRQ 


9950924 


isopentenyl monophosphate kinase 


ipk 


vchB 1 


DA^ft7n 


9950925 


ribose-phosphate pyrophosphokinase 


prs 


prsA : 


DAARTI 


9950927 

W w W W A— f 


probable ribosomal protein L25 




rplY ! 


P A4D / 




peptidyWRNA hydrolase 




pth i 


r A4d f 4 




conserved hypothetical protein 




vapl i 


r A4o / D 




3robable carbonic anhvdrase 




^adF 1 


^^40 / y 


QQ*x0935 


hypothetical protein 






rA4DO 1 




hypothetical protein 







PA4oSjo 




phosphatidylserine synthase 


pssA 




D A ylCQft 

KA4DoD 


QqcnQ'53 


acetolactate synthase large subunit 


lIVI 




rA4Dy / 


QQSn955 


hypothetical protein 






nA40yo 


9950956 


hypothetical protein 




yqcC 


DA^AQQ 


9950957 


hypothetical protein 






PAA7n9 


9950960 


hypothetical protein 






PA470R 


9950964 


probable ATP-binding componeni oi mpo iran 




DhuV 


PAA7i1 


9950970 


hypothetical protein 






l^rVH/ i 1 

PA47'1 A 


9950977 


hypothetical protein 






PA479ft 


9950988 

w w www WW 


2-amino*4*nyoroxy'43~nyuroxynieuiyiuiiiyuiw>jic 


folK 




PAZ179Q 


9950989 

WW www WW 


o-metnyi-^-oxoDUianoaie nyuiUAymcu lyiucii loi 


^anB 




PA/17'^1 


9950992 

w w%y w w w 


aspartate 1>decarboxvlase precursor 


panD 




iDA47'^9 


9950993 

w w^/ww w w 


glucose-6-phosphate isomerase ^ 






PA47'^7 


9950998 


hypothetical protein 






PA473a 


9950999 


conserved hypothetical protein 




vibJ ! 


PA4739 


9951 OOC 


1 conserved hypothetical protein 




i 

J 


PA4740 


9951002 


. polyribonucleotide nucleotidyltransferase 




1 
i 


PA4741 


9951002 


\ 30S ribosomal protein SI 5 


rpsO 


1 


PA4744 


9951006 


translation initiation factor lF-2' 


infB 


j 


PA4745 


9951007 


^ N utilization substance protein A 


nusA 


\ 


PA4746 


9951 00^ 


i conserved hypothetical protein 




ynoLr 


PA4747 


9951 OOS 


) secretion protein SecG 


secG 




PA4748 


. 9951 OK 


) triosephosphate isomerase 


tpiA 


tpi 


PA4749 


99510V 


phosphoqiucosamine mutase 


gImM 


yhbF, mrsA 


PA4750 


995101: 


I dihydropteroate synthase 


folP 


dhpS 


PA4752 


995101i 


5 cell division orotein FtsJ 


ftsJ 


! 


iPA4753 


995101( 


3 conserved hypothetical protein i 


vhbY ! 
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PA4757 
!PA4759 
,PA4762 
iPA4764 
,PA4765 



iPA4767 
iPA4773 



PA4776 



PA4778 



PA4782 



9951020 



9951022 



9951026 



9951028 



conserved hypothetical protein 



dihydrodipicolinate reductase 



heat shocl< protein GrpE 



>dapB 



'yeaS 



fenic uptal<e regulation protein 



9951029louter membrane lipoprotein OmlA 



fur 



omIA 



9951031 conserved hypothetical protein 



9951038 



9951041 



9951043 



9951047 



joprX 



hypothetical protein 



probable two-component response regulator 



probable transcriptional regulator 



jybbl 



1 

— ! 



hypothetical protein 



PA4788 



PA4789 



PA4790 



PA4792 



9951054 



9951055 



hypothetical protein 



conserved hypothetical protein 



9951056 



9951058 



conserved hypothetical protein 



conserved hypothetical protein 



smtA 



PA4797 



9951064 



probable transposase 



PA4802 



PA4809 



PA4813 



PA4823 



9951069 



hypothetical protein 



9951077 



9S51080 



FdhE protein , 



lipase LipC 



9951091 



hypothetical protein 



fdhE 



lipC 



PA4826 



9951095 



hypothetical protein 



PA4828 



9951097 



conserved hypothetical protein 



PA4831 



9951100 



probable transcriptional regulator 



PA4841 



9951111 



conserved hypothetical protein 



PA4847 



PA4848 



PA4850 



9951118 



biotin carboxyl canier protein (BCCP) 



accB 



fabE 



9951119 



9951121 



biotin carboxylase 



accC 



ribosomal protein L1 1 methyltransferase 



prmA 



PA4853 



9951124 



DNA-binding protein Fis 



fis 



PA4861 



9951133 



probable ATP-binding component of ABC tran^ 



PA4864 



9951137 urease accessory protein 



ureD 



PA4866 



PA4868 



PA4870 



PA4871 



9951139 



conserved hypothetical protein 



9951141 



urease alpha subunit 



ureC 



9951143 



conserved hypothetical protein 



9951144 



hypothetical protein 



ybil 



PA4874 



PA4875 



9951148 



conserved hypothetical protein 



psIF 



9951149 



hypothetical protein 



PA4878 



9951152 



probable transcripfaonal regulator 



PA4885 



9951159 



two-component response regulator 



irIR 



PA4887 



PA4890 



9951 161 



probable MPS transporter 



9951165 



conserved hypothetical protein 



,yijC 



PA4892 



PA4894 



9951 1 67 ! urease accessory protein UreF 



ureF 



9951 169 1 hypothetical protein 



PA4895 



PA4906 



PA4908 



PA4916 



9951 170 1 probable transmembrane sensor 



995 1 1 82 1 probable transcriptionalregulator 



9951 185 1 hypothetical protein 



9951 193 [hypothetical protein 



PA4920 



PA4g23 



PA4925 



PA4926 



9951198[NH3-dependent NAD synthetase 



nadE 



9951201 [conserved hypothetical protein 



9951203iconserved hypothetical protein 



9951204|conserved hypothetical protein 



PA4931 



}PA4934 
!PA4935 



9951210|replicative DNA helicase 



idnaB 



9951213I30S ribosomal protein S18 
9951214I30S ribosomal protein S6~ 



jrpsR 



irpsF 
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PA4938 


9951218j£ 


idenvlosuccinate synthetase ipurA ; 


PA4940 i 


9951220 c 


ionserved hypothetical protein ? 


iyjeT 




PA4944 i 


9951224 a 


conserved hvDOthetical protein \ 


!hfq 




PA4945 


9951225 c 


ielta 2-isoDentenvlDvroDhosDhate transferase imiaA \ 


PA4948 i 


9951228 c 


conserved hypothetical protein \ 


iyjeE 




PA4952 * 


9951233 < 


conserved hypothetical protein • 


ivieQ 




PA4956 ' 


9951237 t 


hiosulfate sulfurtransferase 1 


rhdA ; 






I 1 iirT W W w J 

PA4961 1 


9951243 1 


ivDOthetical protein ! 1 


1 1 < V ■ • 1 

P A4962 ! 


9951244 < 


ijonsen/ed hypothetical protein 




y wwi 1 




9951246 1 


.opoisomerase IV subunit A 








9951247 


lypothetical protein 






1 rMfoDO ] 


9961248 


hyiDOthetlcal protein 






lPA4Qfi7 1 


9951249 


topoisomerase IV subunit B 


parE 






9951252 


conserved hypothetical protein 




ice 


I PAZIQ79 


9951255 


hypothetical protein 






iPAiiQRn 


9951263 


probable enoyl-CoA hydratase/isomerase 






PA/lQftfl 


9951272 


3-deoxy-D-manno-octulosonic-acid (KDO) trart 


waaA 




1 PAAQQH 


9951275 


SMR multidrug efflux transporter 






1 PAAQQi 


9951276 


hypothetical protein 






1 DA^OQO 


QQ'S1277 


hypothetical protein 






1 D AylQ07 




transport protein MsbA 


msbA 








nonsen/ed hypothetical protein 






1 r AoUUD 




hypothetical protein 






j rAOUU ( 




hypothetical protein 




wapu ina/A 






hypothetical protein 




wapr waaA 


1 rAOUUo 




lipopolysaccharide core biosynthesis protein \A 




A/vaaP 


rran 


1 r AOUug7 
1 DAAHi n 


QQ«;i996 


UDP-glucose:(heptosyl) LPS alpha 1,3-glucos3 


/waaG 




j r AOU 1 U 


9951297 


heptosyltransferase 1 


waaC 


rfftP 


iDA'^ni'? 


9951298 


heptosyltransf erase II 


waaF 


TfeP 

1 idP 


j rAOU 1 z 

|pAc;noA 
1 n AOUZO 


9Q51317 


conserved hypothetical protein 






lpA*\n*^9 


995132' 


probable transcriptional regulator 






1 PAi^n*^*^ 


9951322 


hypothetical protein 






ipAf^n'^z. 


9951323 

www 1 w^w 


uroporphyrinogen decarboxylase 


nernc 




1 PA J^'^Q 


9951329 

www 1 w^w 


shikimate kinase 


aroix 




1 P A'5044 


9951334 


type 4 fimbnal biogenesis protein riiM 


pillvl 




ipARn*^n 


9951341 


primosomal protein N* 


priA 




PA5051 


9951342 


arginyl-tRNA synthetase 


argS 




1 P A5052 


9951343 


hypothetical protein 






iPA'^OSI 


i 9951353 


conserved hypothetical protein 




phal 




9951355 


ubiquinone biosynthesis methvltransferase Ub 


ubiE 




iPA5Q64 


9951356 


) hypothetical protein 






IpAXsnRi; 


9951357 


' conserved hypothetical protein 




aarF yigR 


i PARf)R7 


9951 36C 


1 phosphoribosyl-ATP pyrophosphohydrolase 


hisE 




|PA5068 


9951361 


translocation protein TatA 


tatA 


mttA yigT 


PA5071 


995136^ 


1 conserved hypothetical protein 






PA5072 


9951365 


) probable chemotaxis transducer 






PA5081 


9951 37£ 


> hypothetical protein 






PA5085 


9951 37J 


3 probable transcriptional regulator 






PA5110 


995140( 


3 fructose-1 .6-bisphosphatase 


fbp 


cbbF, cftcF 


IPA5111 


9961 40( 


3 lactovlalutathione Ivase 


igloAS 


9lo1 


jPA5116 


995141: 


3 orobable transcrlDtional regulator i 




iPA5119 


995141" 


7 glutamine synthetase 


iglnA 
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PA5128 


99514271S 


prpe^tinn nmtfiin SecB :SecB • ; j 


PA5129 


9951428 c 


ilutaredoxin . '9."^ , 




PA5130 

PA5131 : 


9951429 c 

9951430 r 


jonserved hypothetical protein j :y«oN j 

)hosphoglycerate mutase jPSm ;y»bO 1 


PAS 132 
PA5142 


9951431 \ 
9951442 c 


lypothetical protein ' ! 

glutamtne amidotransferase fhisHI ; j 


IPA5144 


9951444 


lypothetical protein ... _j ~ j 


PA5148 


9951448 < 


:onsers/ed hypothetical protein 


5yggx i 


PAS 154 ; 

PA5161 ' 

PA5162 

PAS 163 

PA5164 

PA5173 j 

PA5176 1 

PA5178 

PA5182 

PA5187 

PA5190 

PA5195 


9951455 1 

9951463 ( 

9951464 ( 
9951465 ( 
9951466 
9951476 
9951479 
9951481 
9951 4S6 
9951491 
9951495 
9951500 


Drobable pennease of ABC transporter j 
dTDP-D-glucose 4.6-dehYdratase ! r 
dTDP-4-dehydrorhamnose reductase i 
qlucose-1 -phosphate thynnidylyltransferase i 
clTDP-4-dehvdrorhamnose 3,5-epimerase i 

carbamate kinase ^ 

conserved hypothetical protein 

conserved hypothetical protein 

hypothetical protein 

prnh?ihi*:» aryUCoA dehydroqenase 

probable nitroreductase . 

probable heat shock oroteln 


'mlB 
•mID 
'ml A 
rmlC 
arcC 


rfbB 
rfbD 
rfbA 
•fbC 

vrfE 

yrfH 
visC 


PA5221 

PA5222 

PA5224 

PA5225 

PA5227 

PA5229 

PA5237 

PA5239 

PA5240 

PA5246 

PA5247 

PA5259 

PA5260 


9951529 
9951530 
9951532 
9951533 
9951535 
9951537 
9951546 
9951548 
9951549 
9951556 
9951557 
9951570 
9951571 


probable FAD-deperident monooxy^enase . 
hvnnthp^ticai orotein 

aminopeptidase P 

hypothetical protein ! 
consen/ed hypothetical protein 

conserved hypothetical protein 

conserved hypothetical protein 
transcription termination factor Rho 
thioredoxin 

conserved hypothetical protein 

conserved hypothetical protein 

uroporphyrinogen-lll synthetase 

porphobilinogen deaminase . 


pepP 

rho 
trxA 

hemD 
hemC 


vqfE 
vigC 

vial 
vail 

popE 


PA5275 

PA5276 
PA5278 


9951588 
9951589 


conserved hypothetical protein 
lipopeptide LppL precursor 
diaminopimelate epimerase 


IppL 
dapF 


lyaY 


PA5281 
PA5288 
PA5296 
PA5300 


9951594 
9951602 
9951611 
9951616 


probable hydrolase 

nitrogen regulatory protein P-ll 2 
ATP-dependent DNA helicase Rep 
; cytochrome c5 


gInK 

rep 

cycB 


i£!iS _ 


PA5303 
PA5316 
PA5319 


9951 61 S 
9951632 
9951 63e 


1 conserved hypothetical protein 

\ 50S ribosomal protein L28 
> DNA repair protein RadC 


rpmB 
radC 




PA5320 


9951637 


' DNA/pantothenate metabolism flavoprotein 


dfp 
■dut 




PA5321 
1PA5325 
PA5328 


9951 63E 
995164: 
9951 64( 


) deoxyuridine 5'-triphosphate nucleotidohydrol< 
) hypothetical protein 

5 probable cytochrome c^mono-heme type) 






PA5330 
PA5331 
PA5333 
PA5334 
ipA5335 


995164( 
9951 64< 
995165: 
995165: 
995165^ 


3 hypothetical protein 

^jorotate phosphoribosyltransf erase 

2 conserved hypothetical protein 

3 ribonuclease PH 

4 conserved hypothetical protein 


PyrE 

irph 

i 


vice 
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IPA5336 


9951656 j 


guanylate kinase igmk j . , 


;PA5339 


9951658 1 


conserved hypothetical protein ' ? 


iPA5347 


9951667 


hypothetical protein ! 


\ 


jPA5350 ! 


9951670 


rubredoxin j 




SPA5351 


9951671 


rubredoxin j 




|PA5358 


9951678 


4-hydroxybenzoate'OCtaprenyl transferase | 


ubiA ! 




IPA5364 


9951685 


probable two-component response regulator ! 


.. . i . — _ ! 


iPA5381 


9951704 


hypothetical protein I 


1 
1 


iPA5385 


9951709 


hvDothetical orotein [ 


i 




PA5390 i 


9951714 


probable peptidic bond hydrolase 1 i ' 


PA5396 \ 


9951721 


hypothetical protein j ' 


PA5403 


9951729 


probable transcriptional regulator j j j 


PA5404 1 


9951730 


hypothetical protein 






PA5406 j 


9951732 


hypothetical protein 






PA5408 1 


9951734 


hypothetical protein 






PA5417 J 


9951744 


sarcosine oxidase delta subunit 


— p 

SOXD 




PA5457 i 


9951788 


hypothetical protein 






PA6460 


9961792 


hypothetical protein 






PA5465 ' 


9951797 


hypothetical protein 






PA5469 


9951801 


conserved hypothetical protein 






3A5470 


9951802 


probable peptide chain release factor 




nrfM 

prrn 


PA5480 


9951813 


hypothetical protein 






PA5482 


9951816 


hypothetical protein 






PA5496 


9951831 


hypothetical protein 






PA5503 


9951839 


probable ATP-binding component or ado iran 


5 


. 


PA5526 


9951864 


hypothetical protein 






PA5529 


9951867 


probable sodium/proton antiporter 






PA5531 


9951869 


TonB protein 


tonB 




PA5533 


9951871 


hypothetical protein 






PA5534 


9951873 


hypothetical protein 






PA5643 


9961882 


hypothetical protein 






PA5549 


9951889 


glucosamine-fructose-6-phosphate aminotran 


gImS 




PA5552 


9951892 


glucosamlne-1 -phosphate acetyltransferase/N 


■glmU 


gcaD 


PA5553 


9951893 


ATP synthase epsilon chain 


atpC 


uncC papG 


PA5554 


9951894 


ATP synthase beta chain . 


atpD 


uncD papB 


PA5555 


9951895 


ATP synthase gamma chain 


atpG 


uncG papC 


PA5556 


9951896 


ATP synthase alpha chain 


atpA 


uncA papA 


PA5557 


9951897 


ATP synthase delta chain 


atpH 


uncH papE 


PA5558 


9951898 


ATP synthase B chain 


atpF 


uncF papF 


PA5559 


9951899 


atp synthase C chain 


atpE 


uncE papH 


PA5560 


9951900 


ATP synthase A chain 


atp6 


uncB papD 


PA5561 


9951901 


ATP synthase protein 1 


atpl 


unci 


PA5562 


9951903 


chromosome partitioning protein SpoOJ 


spoOJ 


1 


PA5566 


9951907 


' hypothetical protein 




j 


PA5568 


995190S 


) conserved hypothetical protein 




yidC 


PA5669 


995191C 


1 ribonuclease P protein component 


mpA 




PA5570 


9951911 


50S ribosomal protein L34 


rpmH 


1 
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DNA replication, recombination, modifical 


483 


2027! 


DNA replication, recombination, modifica 


4275 


66951 


Hypotlietical, unclassified, unl<nown 


8339 


7803 i 


Translation, post-translational modificatlo 


1 12488 


104341 


Translation, post-translational modlficatio 


1 13435 


12488; 


Cell wall / LPS / capsule 


14235 


15122! 


Hypothetical, unclassified, unl<nown 


17217 


16900i 


Hypothetical, unclassified, unknown 


24001 


24558! 


Hypothetical, unclassified, unknown 


27646 


28632 


Hypothetical, unclassified, unknown 


36270 


35905 


Amino acid biosynthesis and metabolism 


37893 


37087 


Hypothetical, unclassified, unknown 


40190 


40405 


Hypothetical, unclassified, unknown 


40589 


40816 


Hypothetical, unclassified, unknown 


56546 


56941 


Hypothetical, unclassified, unknown 


61879 


62388 


Hypothetical, unclassified, unknown 


68616 


68188 


Hypothetical, unclassified, unknown 


69272 


69526 


Hypothetical, unclassified, unknown 


70091 


69543 


Hypothetical, unclassified, unknown 


70636 


70130 


Hypothetical, unclassified, unknown 


72680 


73384 


Adaptation, protection 


73468 


73923 


Hypothetical, unclassified, unknown 


74034 


74267 


Hypothetical, unclassified, unknown 


74716 


74279 


Hypothetical, unclassified, unknown 


78097 


77432 


Hypothetical, unclassified, unknown 


80752 


81027 


Hypothetical, unclassified, unknown 


81116 


82174 


Hvpothetical, unclassified, unknown 


83318 


82404 


Hypothetical, unclassified, unknown 


98218 


97754 


Hypothetical, unclassified, unknown 


100124 


101158 


Hypothetical, unclassified, unknown 


115045 


114611 


Hypothetical, unclassified, unknown 


119127 


120164 


Hypothetical, unclassified, unknown 


121346 


122266 


Energy metabolism 


127378 


128502, 


Hypothetical, unclassified, unknown 


131792 


131583 


Hypothetical, unclassified, unknown 


132577 


133155 


Energy metabolism 


134319 


135233 


HyjDothetical, unclassified, unknown 


135259 


135894 


Hypothetical, unclassified, unknown 


136386 


135934 


Hypothetical, unclassified, unknown 


136518 


136991 


Transport of small molecules 


138818 


140167 


Transcriptional regulators 


140216 


140902 


Hypothetical, unclassified, unknown 


143848 


143567 


Hypothetical, unclassified, unknown 


144072 


143845 


Hypothetical, unclassified, unknown 


145542 


145883 


Hypothetical, unclassified, unknown 


149425 


149138 


Transcriptional regulators 


150906 


151823 


Hypothetical, unclassified, unknown 


153696 


153836 


Adaptation, protection 


158199 


158762 


Nucleotide biosynthesis and metabolism 


163426 


164415 


Hypothetical, unclassified, unknown 


165737 


165219 


Transcriptional regulators 


169361 


169906 
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Carbon compound catabolism j 


175503 


176108'; 


transcriptional regulators j 


182768 


183706: 


Hypothetical, unclassified, unknown 


184287 


184439! 


Yanscriptional regulators 


191697 


192362! 


Hypothetical, unclassified, unknown 


194179 


193799 


HvDothetical unclassified, unknown 


194748 


194206 


Mutative enzvmes 


207071 


207823! 


Carbon comoound catabolism 


209533 


207923 


"ransDort of small molecules 


210460 


209621 


Mvnothetlcal unclassified, unknown 


213819 


214634 


Mx/nnthetical unclassified, unknown 


214631 


215512 


Hx/nothetical unclassified, unknown 


229738 


229526 


Uialivts CI i^yi 1 ICO 


232000 


230543 


Troncnnrt nf Qinall molecules 


233100 


232066 


Tr£incnrkr+ r>f email molpculss 


233932 


233123 


Trancnrtrt nf <;mall molPCUisS 


234849 


233929 


Trijncf^rintinnsl rpnillators 


236218 


237111 


>J\/nr\-fKoi-in9l I inrlacscifipH unknOWM 
nypoincrllvcSI, Ul 11.^100011 icu, uiiimiwvyii 


238896 


239777 




240071 


240934 


i-i\/nnthofi/^9l iinHaQ^ified unknown 


241753 


242445 


*1~roner\orf of email mAtpPi lIl^Q 


' 243841 


244605 


1 ranscnpiionai icguiaiuio 


262557 


263498 


1 ranscripuonai reguiaioio 


266616 


267395 


nypoineiic^i, unciaosiiieu, uiiiviiuwii 


268704 


269519 


1 ranscnpiionai reguiaioio- 


275772 


276440 


riypoineiicai, unciassineci, uiiimiuwii 


277334 


276480 


Arv%tn^ o/^irt Kir\o\/nt'hocic sinH mptaholiem 
/\mino aciu oiooyiiuicjoio ciiiu iiiciauuiioiu 


277777 


277331 


)J\#r<ir>fhAti/*<ol 1 mr^lsiCQifioH unknown 
riypOLncuCol, UiIwIcIooIIMSU, UIIIMI^JWU 


282757 


282323 


t-1\/nnfhoftnol iinr'laQQifipH unknown 


283553 


282912 


M\fr^r^tKAf-in9l i inr*laQQlfipH tinknOWO 


289390 


289205 


U\/r\rtfhofii^al i mrl^QeifipH unknown 


. 293304 


291154 


Hvnnfhptiral iinHa«;sified unknown 


293798 


293301 


M\/nnthpfiral iinrlassifted unknown 

nypwU IwUwClly Ul IwlClOOIl Iwvl, Ul ilxiiv^vvi i 


299497 


299081 


Tronennrt' nf email molPOlilPS 
1 Icirispui i Wl 01 Hall HHJM5UUICO 


309092 


307878 J 


Traner^rinfinnal rPfiiilatorj? 


313227 


313925 


Trancnnrt nf email molpoules 


314927 


313938 


H\/nothptiral unclassified unknown 


318148 


317966 


Mwnnthptiral iinHaQsified unknown 

nypuil IdlOdl, lh iwicaoomcwi miiimiwtii 


350841 


350089 


Hvnnfhpttral unclassified unknown 


352164 


351610 


HvDothetical unclassified, unknown 


t 359982 


360332 


Enerav metabolism 


371833 


371162 


HvDothettcai unclassified, unknown 


373725 


374192 


Hvoothetical unclassified, unknown 


378096 


378575 


HvDotheticat unclassified, unknown 


382792 


382037 


Fatty acid and phospholipid metabolism 


383727 


384527 


Nucleotide biosynthesis and metabolism 


384733 


385527 


Biosynthesis of cofactors, prosthetic grot 


1 393308 


393814 


Hypothetical, unclassified, unknown 


402020 


402598 


Energy metabolism 


406498 


406247 


Central intemriediary metabolism 


407098 


406619 


Hypothetical, unclassified, unknown 


413654 


413364! 


Hypothetical, unclassified, unknown 


414529 


4139331 



i 
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Protein secretion/export apparatus 


417527 


418894 


Transcriptional regulators 


420683 


421537 


Hypothetical, unclassified, unknown 


421602 


422207 


Hypothetical, unclassified, unl<:nown 


423460 


423660 


Hypothetical, unclassified, unknown 


427120 


A Off ooo > 

426oD3j 


Hypothetical, unclassified, unknown 1 


439991 


A A OO f\tZ 1 

440395 1 


Nucleotide biosynthesis and metabolism 1 


445691 i 


444687! 


Transcriptional regulators i 


446227 


4457151 


Hypothetical, unclassified, unknown 


446773 


446339 


Hypothetical, unclassified, unknown 


447342 


446773 


Biosynthesis of cofactors. prosthetic grou 


449384 


448431 


Motility & Attachment 


453239 


454114 


Transcriptional regulators 


463079 


463873 


Hypothetical, unclassified, unknown • 


470081 


470650 


Transport of small molecules 


476333 


477790 


Hypothetical, unclassified, unknown 


484404 


484838 


Hypothetical, unclassified, unknown 


496478 


496362 


Transport of small molecules 


496871 


498361 


Related to phage, transposon, or plasmid 


501120 


500104 


Hypothetical, unclassified, unknown 


502599 


501376 


Transcriptional regulators . 


■ 504121 


505029 


Hypothetical, unclassified, unknown 


510499 


509825 


Adaptation, protection 


514775 


514984 


Hypothetical, unclassified, unknown 


526877 


527179 


Hypothetical, unclassified, unknown 


635085 


535489 


Transcriptional regulators 


535539 


536108 


Transcriptional regulators 


539143 


538217 


Transcriptional regulators 


640735 


539786 


Hypothetical, unclassified, unknown 


549614 


549294 


Putative enzymes 


'550381 


649656 


Hypothetical, unclassified, unknown 


550813 


550520 


Putative enzymes 


552746 


552994 


Hypothetical, unclassified, unknown 


558361 


557354 


Biosynthesis of cofactors, prosthetic grou 


560808 


562013 


Biosynthesis of cofactors. prosthetic grou 


562006 


.562728 


Biosynthesis of cofactors, prosthetic grou 


562721 


563545 


Biosynthesis of cofactors. prosthetic grou 


563549 


564235 


Hypothetical, unclassified, unknown 


564344 


564574 


Biosynthesis of cofactors. prosthetic groi 


576040 


575516 


Transcriptional regulators 


586663 


585980 


Putative enzymes 


590105 


590821 


Transcriptional regulators 


594580 


595134 


Hypothetical, unclassified, unknown 


598608 


598994 


Hypothetical, unclassified, unknown 


600176 


599757 


Hypothetical, unclassified, unknown 


600426 


601394 


Hypothetical, unclassified, unknown 




OU 1 0^0 


Central intermediary metabolism 


604896 


603706 


Hypothetical, unclassified, unknown 


609999 


609202 


Energy metabolism 


611281 


612444 


Hypothetical, unclassified, unknown 


612517 


612717 


Carbon compound catabolism 


613338 


1 614402 


Hypothetical, unclassified, unknown 


61754S 


1 616371 
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Hvnothetical. unclassified, unknown 620488 


6201351 


•VPOincuwait ullw!ci&ouic;vj| ui ir\i 


621695 


622033) 


j%«rtA4KAfi#«oi 1 in/*lficc!ftpH unknown 


622726 


622884 


Lj%m/^^inAfiool 1 inr^t^accifipH iinknown 


624199 


623852 


Lj> .M.^i.u/«f:/%oi 1 mnloccifipH unknown 

-jyPOinSuCoi, UnuiaoolllCU, ui iim n-»v»i i 


624803 


624189 


i_ikMAiWiAfS/«oi f mi^locciftpH Iinknown 


629884 


628763 


nypotneticai, unciassineu, uiiinmwiwii 


638830 


638381 


T ransiaiion, posi-iransiauoiiai inuuniuctuw 


1 639115 

www 1 1 w 


638900 


Translation, post-iransiaiionai mouinucJiiu 


\ 639316 

1 WW WW i w 


640341 j 


Biosynthesis of cxjiactors, prosineiic grou 


641073 

W*T 1 W f W 


641426 


Hvpothetical, unciassitiea, un Known 


643714 

W^W f 1 ~ 


643208 


Hypothetical, unciassmea, unknown 


W*TW^WW 


648931 


Hypothetical, unciassmea, unknown 


R'^n'S'^8 


650158 


Biosynthesis of cotactors, prosineiic grou 


UW^*T WW 


651497 


Chaperones & heat snocK proieins 


\JWO f / ^ 


652480 


Adaptation, protection 


OwWwil 1 


653753 


Energy metabolism 


OW«7^ 1 w 


670089 


Transcriptional regulators 


cyaoQI 


672777 


Transcriptional regulators 


U / 09W 1 


673191 


Hypothetical, unclassified, unknown 


D f *tDO i 


675026 


Hypothetical, unclassified, unknown 


Of Ow^U 


675839 


Related to phage, transposon. or plasmid 


Of f UuO 


677409 


Hypothetical, unclassified, unknown 


OODOhD 


686718 


Hypothetical, unclassified, unknown 


00009W 


686899 


Related to phage, transposon, or plasmid 


OOOOUO 


688967 


Related to phage, transposon. or plasmid 


009^00 


689466 


Related to phage, transposon, or plasmid 




690674 


Hypothetical, unclassified, unknown 


oyow^jo 


694366 


Hypothetical, unclassified, unknown 


OvO^w^ 


699720 


Hypothetical, unclassified, unknown 




700835 


Related to phage, transposon, or plasmid 


7nnft*^*^ 


701170 


Hypothetical, unclassified, unknown 


7n 14.77 

/ U l*T f / 


702529 


Related to phage, transposon, or plasmid 




702831 


Related to phage, transposon, or piasmic 




703058 


Transcriptional regulators 


f U wV/ f 


706028 


Hypothetical, unciassiiiea, unknown 


706944 


707366 


Hypothetical, unciassmea, unknown 


70Q182 

1 WW 1 w^ 


708535 


Hypothetical, unciassmea, unknown 


714247 


713279 


Hypothetical, unciassmea, unKnown 


714686 

f 1 "T www 


714264 


Hypothetical, unciassmea, unKnown 


71723' 

1 % 1 ^W 1 


717581 


Protein secretion/expon apparatus 


737530 

f w f www 


737081 


Hypothetical, unciassmea, unKnown 


737677 

/ W i W f f 


738108 


Protein secreuon/expon apparaius 


738485 

1 W W^T w W 


738111 


Protein secretion/export apparatus 


741335 


741928 


Protein secretion/export apparatus 


744333 


745742 


Protein secretion/export apparatus 


745742 


746956 


Hypothetical, unclassified, unknown 


748662 


749774 


Hypothetical, unclassified, unknown 


770156 


770818 


Hypothetical, unclassified, unknown 


770847 


771326 


Hypothetical, unclassified, unknown 


77227£ 


) 772700 


Hypothetical, unclassified, unknown 


775321 


774416 


Putative enzymes 


778181 


776787 
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Putative enzymes ! 


779208L 


7783091 


Transcriptional regulators ! 


782113 


781259: 


Hypothetical, unclassified, unknown 


782229 


782525 


Central intermediary metabolism j 


782570 


782965 


Hypothetical, unclassified, unknown ? 


783833 


783576 


Hypothetical, unclassified, unknown ( 


784698 


7851741 


Hypothetical, unclassified, unknown S 


785969 


786925 


Hypothetical, unclassified, unknown 


786928 


788253 


Related to phage, transposon, or plasmid; 


789144 


789356 


Related to phage, transposon, or plasmid 


790166 


790600 


Related to phage, transposon, or plasmld 


795793 


796776 


Hypothetical, unclassified, unknown 


797261 


797598 


Putative enzymes 


798827 


797925 


Transcription. RNA processing and degrd 


801967 


801275 


Hypothetical, unclassified, unknown 


802239 


801967 


Hypothetical, unclassified, unknown 


805228 


8054731 


Hypothetical, unclassified, unknown 


809882 


809574 


Hypothetical, unclassified, unknown 


828344 


827400 


Secreted Factors (toxins, enzymes, alginS 


I 831914 


832498 


Protein secretion/export apparatus 


835523 


837322 


. Protein secretion/export apparatus 


'837328 


838182 


Translation, post-translational mbdificatio 


■I 839407 


840324 


Hypothetical, unclassified, unknown 


844295 


843723 


Hypothetical, unclassified, unknown 


845682 


845278 


Transport of small molecules 


868646 


858951 


Hypothetical, unclassified, unknown 


859007 


860170 


Hypothetical, unclassified, unknown 


866451 


865636 


Hypothetical, unclassified, unknown 


881077 


881400 


Hypothetical, unclassified, unknown 


883216 


882989 


Hypothetical, unclassified, unknown 


885635 


886108 


Transcriptional regulators 


893041 


893994 


Hypothetical, unclassified, unknown 


895668 


, 895396 


Hypothetical, unclassified, unknown 


896416 


897228 


Hypothetical, unclassified, unknown 


898886 


898440 


Hypothetical, unclassified, unknown 


900165 


899830 


Hypothetical, unclassified, unknown 


901046 


900408 


Putative enzymes 


903692 


904633 


Chaperones & heat shock proteins 


913086 


913571 


Hypothetical, unclassified, unknown 


929084 


929503 


Hypothetical, unclassified, unknown 


1 930476 


929514 


Putative enzymes 


932725 


932102 


Cell division 


935989 


936294 


Hypothetical, unclassified, unknown 


942648 


943430 


Hypothetical, unclassified, unknown 


948776 


949159 


Hypothetical, unclassified, unknown 


949280 


949693 


Cell wall / LPS / capsule 


950648 


949716 


Amino acid biosynthesis and metabolism 


952514 


^ 952158 


Hypothetical, unclassified, unknown 


955722 


955456 


Putative enzymes 


96254£ 


) 962925 


Hypothetical, unclassified, unknown 


977742 


; 977420 


Hypothetical, unclassified, unknown 


98424£ 


) 984535 


Translation, post-translational modificatk 


}1 98681 e 


989442 
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! A MAIM A v%Sr\e\fnthocic onH iTiP^tsbolisnn « 


989590; 


990828= 


ITranscripuonai re^uiaiors 


991013 


9911 98 i 


ranscnptionai reguiaiors i 


992643 


991830; 


riypoinsiicai, unciasoiTieu, uiimiuwh j 


993409 


9937831 


nypotnsucai, unciassiiieu, uiimiuwh i 


993776 


994051 ; 


Transport or small moiecuies « 


996038 

W W W W w W 


997486 


Hypotheucal, tinciassmea, uni\nuwn 


1007548 


1 007234 { 


Carbon compound catabolism j 


1012972 


1011983; 


Ammo acid Diosyninesis ana meiauan&m 


1020708 


1021607 


Hypothetical, unclassmed, unknown 


1 n97445 


1027984 


Hypothetical, unclassified, unKnown 


m'^msi 

1 wOU 1 w 1 


1029825 


Nucleotide biosynthesis ana meiaDoiism 




1032095 


Nucleotide biosynthesis ana metaDoiism 




1032763 


Hypothetical, unclassified, unknown 




1035981 


Putative enzymes 




1040432 


Putative enzymes 




1040707 


Translation, post-translationai moomcaiio 


1 '104*^404 


1041689* 


Hypothetical, unclassified, unknown 


H n4R4R9 


1046671 


Adaptation, protection 


in4ftniQ 

1 U*tOU 1 17 


1047549 


Transport of small molecules 




1 0*54543 


Transport of small molecules 


I UOhODD 


10^500(5 


Transport of small molecules 




10^60*^9 


Transport of small molecules 




1 0*^7*^47 


Transport of small molecules 


•1 r\K7yinn 

lUOf *tUU 


1 057906 


Hypothetical, unclassified, unknown 




1060996 


Hypothetical, unclassified, unknown 


1UD4UOH 


1061907 


Hypothetical, unclassified, unknown 


1 U04ooc7 


1069061 


Hypothetical,, unclassified, unknown 


1 U04DU 1 


1069885 


Hypothetical, unclassified, unknown 


\ U04y^ 1 


1063544 


Hypothetical, unclassified, unknown 


lUDO I OO 


1065495 


Secreted Factors (toxins, enzymes, algini 


3 1 UD f O 1 f 


1066321 


Hypothetical, unclassified, unknown 


1 UOO 1 


1068456 

1 Www*Tw W 


Hypothetical, unclassified, unknown 


inTiR77 
1 U r lOf 1 


1071239 


Hypothetical, unclassified, unknown 




1072839 

l\J 1 ^W WW 


Chaperones & heat shock proteins 




1074673 


Hypothetical, unclassmed, unknown 




1083854 


Hypothetical, unclassmed. unknown 




1090857 


Adaptation, protection 




1092025 

1 WW&W4vW 


Amino acid biosynthesis and metaooiism 




1094129 

1 w ■ AbW 


Hypothetical, unclassmed, unknown 




1096034 


Nucleotide biosynthesis ana meiaDoiism 


1006063 

1 Www 


1096773 


Putative enzymes 


1107000 


1107761 


Hypothetical, unclassmed, unKnown 




1112574 


Hypothetical, unciassmea, unKnown 


112385Q 


1123366 


HvDothetical unclassified, unknown 


1125865 


1125548 


Hypothetical, unclassified, unknown 


1126336 


; 1125865 


Biosynthesis of cofactors. prosthetic groi 


i 1136386 


1 1137035 


Hypothetical, unclassified, unknown 


1144862 


. 1145200 


Hypothetical, unclassified, unknown 


115047C 


) 1150721 


Chaperones & heat shock proteins 


1153637 


' 1155547 


Hypothetical, unclassified, unknown 


116366C 


) 1164022 


Hypothetical, unclassified, unknown 


117561^ 


1176375 
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vpothetical. unclassified, unknown ! 1176380 


1176982! 


Hwnnthetical. unclassified, unknown : 1176958 


1177620! 


LJ««v\/\fhafir*al iinHacQifiPd UPlknOWn 
ypOUlcilCali unv-»ioooiiic;u, ui irvi i 


1186606 


1186986! 


\«#/% /^rtmni^nanf roniltstorV SVStSITIS 

WO'-cornponeni re^uiaiuiy o^^i^^m^ 


1189172 


1190380 


Otilliy « MuacnrTiciu 


1194207 


1195223 


IVlOIHIiy & MUoCriillcni L 


1197390 


1197833! 


ypotricucai, unciasoiiicu, uiirviiwvvn < 


1198551 


1197838 


ypotneticai, unciassinea, unisnuwn 


1199946 


1198750 


Hypothetical, unciassitiea, unknown 


1207875 


1207540 


ypotheucal, unciassmea» unisnowii 


1212571 


1211888 


Hypothetical, unciassinea, unKnown 


1214502 


1213195 


Translation, post-transiationai moamcaiio 


1215284 


12157271 


Biosynthesis of coTactors. prosinenc grou 


1218181 


1218933 


Antibiotic resistance ano suscepupmiy 


1221691 


1222098 


Hypothetical, unclassifjea, unKnown 


1996278 


1225757 


Hypothetical, unclassifiedi unKnown 


1997302 


1226427 


Transcriptional regulators 


i99Q'^4 


1230219 


Transcriptional regulators 




1237546 


Hypothetical, unclassified, unknown 


19A*^1 1ft 
IZ*tO 1 1 o 


1242750 


Adaptation, protection 




1245928 


Hypothetical, unclassified, unknown 


IZ*toOO^ 


1249019 


Nucleotide biosynthesis and metabolism 




1249907 


Nucleotide biosynthesis and metabolism 




1251418 


Two-component regulatory systems 




1255752 


Adaptation, protection 


I^O (lie. 


1257981 


Hypothetical, unclassified, unknown 


19Rft47n 


1258087 


Amino acid biosynthesis and metabolism 




1259291 


Hypothetical, unclassified, unknown 




1264190 


Hypothetical, unclassified, unknown 




1264191 


Hypothetical, unclassified, unknown 


i9fifi11'' 


1266782 


Hypothetical, unclassified, unknown 


1267282 


1267602 


Eneray metabolism 


1272796 


12722001 


Energy metabolism 


1273298 


1272807 


Enerav metabolism 


1276608 


1276117 


Energy metabolism 


1276784 


1276617 


Transport of small molecules 


1284513 


1285823 


Hypothetical, unclassified, unknown 


1294138 


. 1294809 


Hypothetical, unclassified, unknown 


1303457 


1303864 


HvDothetical. unclassified, unknown 


1303892 


1304449 


HvnothGtical. unclassified, unknown 5 1305583 


1306056 


Hypothetical, unclassified, unknown 


1314321 


1313362 


Hypothetical, unclassified, unknown 


1317202 


1315916 


Hypothetical, unclassified, unknown 


1318150 


1317404 


Amino acid biosynthesis and metabolism 


1319514 


1318147 


Hypothetical, unclassified, unknown 


132100C 


1 1320386 


Transcriptional regulators 


1326939 


1 1326046 


Putative enzymes 


1327024 


1327803 


Hypothetical, unclassified, unknown 


1330467 


1330117 


Hypothetical, unclassified, unknown 


1331517 


' 1331714 


Hypothetical, unclassified, unknown 


1334671 


1334928 


Antibiotic resistance and susceptibility 


1 1339082 


1337931 


Secreted Factors (toxins, enzymes, algir 


td 1357317 


' 1357712 
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jTranscriptional regulators 



13700921 1369418! 



|Transcnptionai regulators i 
Hypothetical, unclassified, unknown 


loyi^/ / 


109 1 OQ 1 ' 


Transcriptional regulators 




A OQ71 on 


Hypothetical, unclassified, unknown 




1 ^UO / 


Hypothetical, unclassified, unknown 


1408999 




Transcriptional regulators 


1409949 


AA^ t\A7fi 


Putative enzymes 


1417500 


141 /yoi 


Hypothetical, unclassified, unknown 


1417965 


AAA Q7QQ 

14 1 o/oo 


Transcriptional regulators 


1425754 


A A*^CiA A f\ 

1425140 


Energy metabolism 


1432037 


A. A^0Q07 


Hypothetical, unclassified, unknown 


A A ^ C A 

1435493 


A AOCQOtZ 

14o0ozo 


Transcriptional regulators 


1441547 


AAA rt^OO 

1440do9 


Hypothetical, unclassified, unknown 


1444451 


A A A^C\r\A 

1442904 


Transport of small molecules 


1456723 


A A CLdQA C 

1455o1o 


Hypothetical, unclassified, unknown 


1462696 


1463403 


Hypothetical, unclassified.. unknown 


1463586 


1463936 


Hypothetical, unclassified, unknown 


1464111 


1464860 


Hypothetical, unclassified, unknown 


1467320 


1466109 


Hypothetical, unclassified, unknown 


1467901 


1467488 


Hypothetical, unclassified, unknown 


^468510 


1468890 


Hypothetical, unclassified, unknown 


1470978 


1470580 


Hypothetical, unclassified, unknown 


1474391 


1474714 


Transcriptional regulators . 


1475464 


1476306 


Hypothetical, unclassified, unknown 


1479791 


1479021 


Hypothetical, unclassified, unknown 


1483123 


1483875 


Hypothetical, unclassified, unknown 


1483898 


1485763 


Hypothetical, unclassified, unknown 


1486967 


1486266 


Hypothetical, unclassified, unknown 


1489095 


1486960 


Biosynthesis of cofactors. prosthetic grou 


1491913 


1493055 


Hypothetical, unclassified, unknown 


1494959 


A A f\C A rtO 

1495492 


Hypothetical, unclassified, unknown 


1495635 


1495997 


Putative enzymes 


1496920 


1496087 


Hypothetical, unclassified, unknown 


1516433 


1516687 


Two-component regulatory systems 


1518914 


A CA QCAd 

1519o4o 


Hypothetical, unclassified, unknown 


1519627 


A tZ A DOCO 

1519900 


Hypothetical, unclassified, unknown 


1526657 


1 52d430 


Carbon compound catabolism 


1533238 


1534270 


Hypothetical, unclassified, unknown 


1552641 


155299/ 


Hypothetical, unclassified, unknown 


1553112 


A CCOC7R 

1553D/0 


Transcriptional regulators 


1559122 


A ccooor\ 


Adaptation, protection 


1559254 


A ccoQfs:rt 
1009o09 


Hypothetical, unclassified, unknown 


157202O 


AC70RAA 


MOuiity w Attacnmeni 


1575290 


1575559 


Motility & Attachment 


1583956 


1584798 


Two-component regulatory systems 


1585640 


1586014 


Chemotaxis 


1591286 


1592176 


Cell division 


1592271 


1593059 


Chemotaxis 


1594087 


1594566 


Hypothetical, unclassified, unknown 


1594597 


' 1595004 


Hypothetical, unclassified, unknown 


159688S 


) 1597305, 


Hypothetical, unclassified, unknown 


1599982 


> 1599428 
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Transport of small molecules 


16021791 


1 o02odU 


Transport of small molecules 


16028771 


1603o4o 


Transport of small molecules ; 


1603671! 


1 604429 1 


Hypothetical, unclassified, unknown i 


1604426] 


1 o04oOZ 


Energy metabolism [ 


1605088 


1607061 


Energy metabolism | 


1607065 


1607607 


Energy metabolism « 


1607604 


1608071 


Hypothetical, unclassified, unknown i 


1615908 


A^A AG^f\ 

1614670 


Hypothetical, unclassified, unknown 


1617085 


1615895 


Hypothetical, unclassified, unknown 


1619907 


1620263 


Transport of small molecules 


1624715 


1623864 


Transcriptional regulators 


1634492 


1633842 


Hypothetical, unclassified, unknown 


1638639 


1638379 


Hypothetical, unclassified, unknown 


1639794 


1638652 


Hypothetical, unclassified, unknown 


1647046 


1646537 


Hypothetical, unclassified, unknown 


1649565 


1648629 


Hypothetical, unclassified, unknown 


1649928 


1650308 


Transcriptional regulators 


1660727 


1661386 


Cell division 


1665065 


1665934 


DNA replication, recombination, modifical 


1666025 


1668409 


DNA replication, recombination, modifical 


1669989 


1672034 


Hypothetical, unclassified, unknown 


1672080 


1672406 


Putative enzymes 


1673204 


1674352 


Hypothetical, unclassified, unknown 


1677559 


1678407 


Hypothetical, unclassified, unknown 


1678590 


1678261 


Transport of small molecules 


1678952 


1678584 


Hypothetical, unclassified, unknown 


1684962 


1684753 


Energy metabolism 


1689339 


1687924 


Energy metabolism 


1694045 


1693119 


Hypothetical, unclassified, unknown 


1696467 


1697099 


Hypothetical, unclassified, unknown 


1697188 


1697919 


Hypothetical, unclassified, unknown 


1697916 


1698344 


Hypothetical, unclassified, unknown 


1704522 


1704761 


Hypothetical, unclassified, unknown 


1709765 


1709412 


Hypothetical, unclassified, unknown 


1712698 


1712519 


Energy metabolism 


1720744 


1721130 


Energy metabolism 


1721124 


A A A 

1721492 


Enerqy metabolism 


1721496 


1723268 


Energy metabolism 


1723280 


1723987 


Energy metabolism 


1728416 


1729852 


Energy metabolism 


1730181 


1731347 


Energy metabolism 


1731347 


1732234 


Hypothetical, unclassified, unknown 


1734004 


1734768 


Hypothetical, unclassified, unknown 


1735063 


1734827 


Hypothetical, unclassified, unknown 


1735236 


1735709 


Hypothetical, unclassified, unknown 


1736706 


• 1736152 


Hypothetical, unclassified, unknown 


17361 8S 


1 1737418 


Fatty acid and phospholipid metabolism 


175341E 


1752903 


Hypothetical, unclassified, unknown 


1762432 


1762870 


Transcriptional r gulators 


176292E 


\ 1763762 


Putative enzymes 


1765345 


) 1766205 


Hypothetical, unclassified, unknown 


17662851 17669471 
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Hypothetical, unclassified, unknown i 1766956 


1767762! 


Transcriptional regulators \ . 1773682 


1774548 


Transport of small molecules 1775945 


1776034 


Transport of small molecules i 1 779877 


1780428 


Hypothetical, unclassified, unknown \ 1784108 


1785016 


Hypothetical, unclassified, unknown | 1786888 


1787166 


Hypothetical, unclassified, unknown i 1790879 


17904721 


Hypothetical, unclassified, unknown » 1 80521 8 


1805724 


HvDothetical. unclassified, unknown j 1814995 


1815135 


LJtfnr\4>hAfiostl 1 in/^lflQcifip^d unknown 


1816352 


1816858 


u%/nnihafirai itnrlflQ^ified unknown 


1824969 


1825430 


oiOsyninesis ot uuidi-riuio, piuouicu^ j^jiwu 


1 826040 


1825495 


•ypotneucai, unciasotiicu, uiir\iiuwii 


1826675 


1826118 


nypotneticai, unciassnieu. uiimiuwii 


1827052 


1826732 


Amino aciQ Diosyninesis aiiu iiicLoutjiidiM 


1836367 


1837227 


^rotein secreuon/expori appaiaiuo 


1841517 


1840468 


Protein secretion/export apparatus 




1841514 


Protein secretion/expon apparatus 




1845241] 


^rotem secretion/expon apparaius 


1847227 


1848093 


'typotheticat. unciassiiiea, unisnown 


1 848707 


1849072 


iypotlieticdi, unciassinea, un Known 


1 849077 


1849406 


^rotein secretion/expon apparaius 


1851982 


1852278 


■^rotein secretion/expori apparaiuo 


1856862 

i *J WW W w^ 


1856299 


nypotneticai, unciassmea, univnown 


1856308 

1 www WW w 


1856553 


^rotein secretion/expon apparaius 


1 8B2558 


1862761 


^rotein secretion/expon apparatus 


1862764 


1863021 


-Toiein secreiion/export appciidiud 


1 863024 


1863371 


■^roiein secreuon/expui i ctppcai aiuo 


1863799 


1864137 


lypoinsucai, unciaooiiit?u, uiiiviiwwii 


1874967 


1875767 


j%#nnthek4-i<^oi 1 in/^IacQifi^H iinWnnwn 


1875849 


1876580 


-4\mnfhoHr»l i inH?)ssified unknown 


1887297 


1887058 


\A\ir\f\\y\eMr7i\ unclassified unknown 


1888186 


1887698 


Hvnothptinai unclassified, unknown 


1889173 


1888985 


Amino arirl hiosvnthesls and metabolism 


1891815 


1890739 


Amino acid biosvnthesis and metabolism 


1896630 


1897247 


Hypothetical, uriclassified, unknown 


1912756 


1912217 


Biosynthesis of cofactors, prosthetic grou 


1917599 


1918087 


Transport of small molecules 


1921174 


1922226 


Central intermediary metabolism 


1926123 


1925797 


Transport of small molecules 


1931775 


1930564 


Hypothetical, unclassified, unknown 


1933632 


1933054 


Energy metabolism 


1937644 


1935035 


Hypothetical, unclassified, unknown 


1939583 


1940206 


Hypothetical, unclassified, unknown 


1942442 


1941720 


Translation, post-translational modificatic 


n 1943067 


1944737 


Translation, post-translational modificatic 


n 1944747 


1946129 


Biosynthesis of cofactors, prosthetic groi 


1 1947041 


1946187 


Translation, post-translational modificatic 


)i 1956227 


1958623 


DNA replication, recombination, modifica 


\ 1972959 


1973405 


Nucleotide biosynthesis and metabolism 


1973470 


1974210 


Hypothetteal, unclassified, unknown 


1974627 


1974238 


Transport of small molecules 


1979941 


1978439 
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j Hypothetical, unclassified, unknown 


1984389 


1985033 


i Hypothetical, unclassified. unl<nown 


1989423 


1989109 


Hypothetical, unclassified, unknown 


1993898 


1993461 


Hypothetical, unclassified, unknown 


1995164 


1994667 


Hypothetical, unclassified, unknown 


1998587 


1998949 


Hypothetical, unclassified, unknown 


1999874 


1999512 


Hypothetical, unclassified, unknown 


2004892 


2004374 


Hypothetical, unclassified, unknown 


2007665 


2008249^ 


Hypothetical, unclassified, unknown 


2012530 


2012255 


Hypothetical, unclassified, unknown 


2015136 


2014915 


Transcriptional regulators 


2019690 


2018803 


Transport of small molecules 


2022398 


2021712 


Hypothetical, unclassified, unknown 


2028454 


2028981 


Fatty acid and phospholipid metabolism 


2031466 


2031705 


Hypothetical, unclassified, unknown 


2034857 


2034066 


Transport of small molecules 


2052941 


2053264 


Energy metabolism 


2053277 


2053675 


Transcriptional regulators 


2054223 


2053672 


Hypothetical, unclassified, unknown 


2054309 


2054842 


Hypothetical, unclassified, unknown 


2062401 


2061664 


Hypothetical, unclassified, unknown 


2065545 


2064853 


Hypothetical, unclassified, unknown. 


2066767 


2065493 


Hypothetical, unclassified, unknown 


2067955 


2066786 


Hypothetteal. unclassified, unknown 


2068728 


2067961 


Secfeted Factors (toxins, enzymes, algin? 


a 2070685 


2071173 


Secreted Factors (toxins, enzymes, algins 


^ 2071209 


2071697 


Secreted Factors (toxins, enzymes, algin^ 


L 2076311 


2076958 


Transcriptional regulators . 


2085426 


2084476 


Transcriptional regulators 


2085929 


2085423 


Hypothetical, unclassified, unknown 


2088034 


2086808 


Hypothetteal, unclassified, unknown 


2091837 


2091490 


Hypothetical, unclassified, unknown 


2103294 


2103770 


Hypothetical, unclassified, unknown 


2103770 


2104096 


Translation, post-translational modificatio 


1 2109511 


2108942 


Hypothetical, unclassified, unknown 


2109854 


2109558 


Hypothetical, unclassified, unknown 


2117897 


2118097 


Hypothetical, unclassified, unknown 


2118585 


2118893 


Hypothetical, unclassified, unknown 


2118926 


2119747 


Hypothetical, unclassified, unknown 


2122222 


2120225 


Hypothetical, unclassified, unknown . 


2137785 


2136520 


Hypothetical, unclassified, unknown 


2138598 


2137846 


Hypothetical, unclassified, unknown 


2141002 


2140433 


Hypothetical, unclassified, unknown 


2141487 


2140999 


Hypothetical, unclassified, unknown 


2145894 


2146502 


Hypothetical, unclassified, unknown 


2146609 


2146875 


Hypothetical, unclassified, unknown 


2148854 


2149189 


Hypothetical, unclassified, unknown 


2150364 


2149864 


Hypothetical, unclassified, unknown 


2150524 


2150781 


Hypothetical, unclassified, unknown 


2157968 


2159167 


Transcriptional regulators 


2164648 


2163883 


Two-component regulatory systems 


2165876 


2166553 


Biosynttiesis of cofactors, prosthetic groi 


J 2171865 


. 2171936 
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^Biosynthesis of cofactors. prosthetic groui 2171989j 


21729031 


iBiosynthesis of cofactors. prosthetic grouj 2173662! 


2173940! 


HvDothGtical. unclassified. unl<nown \ 2181744 


2181181! 


U\ini^fhotir^oi iinHnQQifipH unknowH ^ 


2182097 


21817411 


onaperones ot neai onu^./rv jjiwicnio 


2182394 


2182116; 


"any acio ano pnospnoufjiu mciauwiioiii 


2187065 


21882461 


ypotnetjcai, uncias.smeu, univnuvvii » 


2188459 


21898831" 


Carbon compouna caraDoiism j 


2196132 


2195494] 


Transcnptional repuiaiors 


2198891 


2199694 


Putative enzymes 


2203446 


2202649 


-atty acid and pnospnoiipia meiaDonsm 


9906353 


2205190 


Transcriptional regulators 


9906806 


2206402 


Hypothetical, unclassitiea. unknown 


990fiQQ9 


2207928 


Hypothetical, unclassifiea, unKnown 


991*^1*539 


2213315 


Hypothetical, unclassified, unknown 


991 onoQ 


2218098 


Hypothetical, unclassified, unknown 


099097*^ 


2220574 


Hypothetical, unclassified, unknown 


idiLiC 1 1 u / 


2220903 


Hypothetical, unclassified, unknown 




2224478 


Hypothetical, unclassified, unknown 


/ O** 1 


2229001 


Transport of small molecules 




2235309 


Transcriptional regulators 




2245948 


Central intermediary metabolism 




2245986 


Transport of small molecules 




2257720 


Putative enzymes 


ZZOoA / 0 


2260659 


Hypothetical, unclassified, unknown 


ZZOOf 


22651261 


Translation, post-translational modificatio 


1 ZZ I Z^DU 


2274568 


Transport of small molecules 


OOT7i^l^9 
ZZ/ /OOZ 


2278982 


Hypothetical, unclassified, unknown 


007QQQO 

zZf pyoz 


2279794 


Hypothetical, unclassified, unknown 


000-1 C7Q 
ZZo 1 0/ 0 


2279917 


Hypothetical, unclassified, unknown 




2289085 


Hypothetical, unclassified, unknown 


ZZ9 / Uf z 


2297920 


Hypothetical, unclassified, unknown 


ZOUUW / w 


2301755 


Transport of small molecules 


9303022 


2304215 


Hypothetical, unclassmea, unKnown 


9306697 


2305782 


Carbon compound cataDousm 


9*^070*57 


2309432 


Hypothetical, unclass.mea, unknown 


9319899 


2313789 


Biosynthesis of cotaciors, prosuieuc groi 


9314512 


2315690 


Putative enzymes 


9316708 


2317403 


Hypothetical. unclassiTiea, unknown 


9318624 


2318226 


Hypothetical, unciassiiiea, unKnown 


9322071 


2321130 


UNA replication, recompinauun, moaiiioa 


1 2329348 


2330424 


PutafrvG enzvmes 


2332059 


2330959 


Hypothetical, unclassified, unknown 


2332820 


2332392 


Transcriptional regulators 


2335172 


2336104 


Hypothetical, unclassified, unknown 


2339987 


2339352 


Hypothetical, unclassified, unknown 


2346477 


2347838 


Hypothetical, unclassified, unknown 


235243C 


1 2351906 


Putative enzymes 


2356712 


\ IZblblZ 


Hypothetical, unclassified, unknown 


2358024 


\ 2358311 


Hypothetical, unclassified, unknown 


236170e 


> 2361873 


Hypothetical, unclassified, unknown 


2364816 


1 2365058 


Hypothetical, unclassified, unknown 


237747C 


2376538 
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ypbthetical. unclassified, unl<nown 2381778 


2381473' 


Hypothetical, unclassified, unknown \ 2390256 


2390620, 


HvnothGtical. unclassified, unknown ; 2390949 


2392046 


HvDothetical unclassified, unknown \ 


2393424 


2393633 


Hypothetical, unclassified, unknown 


2393708 


2394178' 


Hvnnthetical unclassified, unknown 


2396252 


2395944 


Wvnnthetical unclassified, unknown 


2396883 


2396536 


Hvnnthetical unclassified, unknown 


2404656 


2404386 


WvnnthAtiral unclassified, unknown 


2405233 


2404949 


Mvmothotirai iinrl?3ssified unknown 


2405739 


2405230 


Ulx/nnfKotinal iinr l?9c;cif|or! unknOWH 


2405993 


2406877 


ui\/r>nfhotin5ii iinHassifisd unknown 


2406961 


2407131 


l-4v/r^r^fhott^'2:)l 1 inclassified unknown 


2407236 


2407661 


ui%frv«^fhofinai iinntn^^ifiecl unknowH 


2409837 


2410181 


fu^fi^ot 1 mr'loQcifipH unknown 
riypoineiicai, unoidooincu, uiir\iiwv¥ii 


2411709 


2412122 


ranscnptionai reguiaiorb 


2416661 


2416245 


nypomeiicai) unciaooiiicu, uuivnwwii 


2416376 


2417413 


LJtfM^f-KAf!^*^! 1 inr»iscotftiaH 1 inknn\A/n 

riypotneucai, unciassiueu, ummiuwu 


2424400 


2423924 


nypotneticai, unciassmeu, uni\iiuwn 


2427017 


2425497 


nypotneticai, unciassmea, unKiiowu 


2430170 


2431129 


rranspon or srnaii moiecuieo 


. 2433748 


2435070 


ransport ot smaii niGiecuies 


2441555 


2440347 


Transcriptional regulators 


9441771 


2442691 


nypotnettcal, unciassiTiea, unKnown 


2443161 

T W IW 1 


2444366 


riypOtnetlCai, UnCiassmea, uni\nuwri 


2446533 


2444886 


Hypotneticai, unciassmeui uni\nuwu 


2446564 


2445545 


-lypotneticai, unciassiTieu, uimuuwii 


2447325 


2446597 


nypoineiicai, unciassiiiea, . ui im il»wi i 


2447989 


2447573 


riypoineucai, uncicaoonicu, uiir\iiuwii 


2448533 


2448033 


*rp«^r^f>/^v*inti/^r>o1 root lIsttf^rQ 

1 ranscnpuonoi reyuicaiui^ 


2449545 


2448568 


u%f nnfKAfir^ol 1 ini^lacQifi^rl unknown 
nypOuloXl^^ii Un^ldoollicu, Ui iivi iwwi 1 


2450765 


2449554 


riypoinciicai, uncia^oiiicu, uiirMiwwii 


2451707 


2452426 


t ranspcjiT or oiiiciii hil/icuuh^o 


2457510 


2458280 


ui\/nr%thotirsil I innla^^iftpH unknown 


2468032 


. 2469099 


j\/nrkfhAfir^oi I innlnQ^ifipri unknown 

^ypuin6uCal| UllwiClooiiiv5U| uiiixiiwvvii 


.2472104 


2472409 


Ulwnnthotiral i inptasslfied UflknOWH 


. -2479130 


2478312 


Aminn aHH HinQvnthPsIs and metabolism 


2480844 


2481830 


Qor*natoH Partor^ ^toxIns enzvmes aloin 


a 2485471 


2486118 


Trancorintinnal reoulators 


2487293 


2486355 


Hvnnthiatiral unclassified unknown 


2488950 


2489732 


Hvnothetical unclassified, unknown 


2508775 


2509467 


Hypothetical, unclassified, unknown 


2510622 


2511050 


Hypothetical, unclassified, unknown 


2512511 


2513182 


Hypothetical, unclassified, unknown 


2523239 


2522958 


Hypothetical, unclassified, unknown 


2524201 


2523236 


Transport of small molecules 


2525052 


2524198 


Transport of small molecules 


2525846 


> 2525049 


Energy metabolism 


2527657 


' 2527412 


Putative enzymes 


2529467 


' 2527743 


Hypothetical, unclassified, unknown 


2540082 


! 2539063 


Hypothetical, unclassified, unknown 


254974£ 


2549906 


Transcriptional regulators 


255396! 


> 2554858 
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Transport of small molecules 


25716911 


2570855 j 


Hypothetical, unclassified, unknown 


2573365 


2572805' 


ypothetical. unclassified, unknown | 


2579542 


2580882 


Transport of small molecules 


2582097 


25834071 


Carbon compound catabolism 


2587906 


2589414 


Hypothetical, unclassified, unknown 


2593330 


2594547 


Hypothetical, unclassified, unknown 


2595985 


2596779 


Transport of small rholecules 


2597869 


2698522 


Hypothetical, unclassified, unknown 


2614893 


261 5438 j 


Hypothetical, unclassified, unknown 


2617019 


2617516 


Hypothetical, unclassified, unknown 


2617529 


2617954 


Hypothetical, unclassified, unknown 


2619695 


2620711 


Hypothetical, unclassified, unknown 


2623284 


2623856 


Hypothetical, unclassified, unknown 


2627175 


2626780 


Transcriptional regulators 


2635971 


2635051 


Transport of small moJecules 


2645303 


2646727 


Hypothetical, unclassified, unknown 


2689240 


2689569 


Hypothetical, unclassified, unknown 


2689566 


2690126 


Putative enzymes 


2694544 


2693780 


Hypothetical, unclassified, unknown 


2694763 


2694545 


Hypothetical, unclassified, unknown 


2701205 


2702065 


Hypothetical, unclassified, unknown . 


2705772 


2706086 


Hypothetical, unclassified, unknown 


2723221 


2722754 


Hypothetical, unclassified, unknown 


2724222 


2723308 


Hypothetical, unclassified, unknown 


2724484 


2724729 


Hypothetical, unclassified, unknown 


2730520 


2729975 


Hypothetical, unclassified, unknown 


2732965 


2732531 


Hypothetical, unclassified, unknown 


2738840 


2739715 


Amino acid biosynthesis and metabolism 


2740882 


2739761 


Amino acid biosynthesis and metabolism 


2747072 


2746689 


Hypothetical, unclassified, unknown 


2753464 


2752865 


Hypothetical, unclassified, unknown 


2754601 


2754822 


Hypothetical, unclassified, unknown 


2755724 


2756251 


Hypothetical, unclassified, unknown 


2756308 


2756649 


Hypothetical, unclassified, unknown 


2760086 


2759481 


Hypothetical, unclassified, unknown 


2760618 


2760322 


Hypothetical, unclassified, unknown 


2761350 


2760871 


Hypothetical, unclassified, unknown 


2781450 


2780926 


Transcriptional regulators 


2786355 


2785369 


Transcriptional regulators 


2787885 


2786971 


Central intermediary metabolism 


2791219 


2791863 


Hypothetical, unclassified, unknown 


2791905 


2792816 


Putative enzymes 


2794132 


2792798 


Hypothetical, unclassified, unknown 


2803344 


2803622 


Hypothetical, unclassified, unknovim 


2804131 


2803859 


Hypothetical, unclassified, unknown 


2805916 


1 2806290 


Putative enzymes 


280736fi 


1 2806349 


Transcriptional regulators 


280746S 


1 2808511 


Hypothetical, unclassified, unknown 


2815281 


2814766 


Transport of small molecules 


2817448 


2818674 


Hypothetical, unclassified, unknown • 281888£ 


2818718 


Hypothetical, unclassified, unknown 2822321 


2821704 
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iCarbon compound catabolism 



2825590 



2824658] 



Carbon compound catabolism 



2833129 



2832368 



ICarbon compound catabolism 



2834689 



2834201 



[Transport of small molecules 



2841965 



284051 1j 



IFatty acid and phospholipid metabolism 



2865104 



2864169 



I Hypothetical, unclassified, unknown 



2866199 



2865747 



Hypothetical, unclassified, unknown 



2867505 



2866192 
2875696 



Hypothetical, unclassified, unknown 



2876409 



H ypotheticaL unclassified, unknown 



2880519 



2881562J 



[Transcriptional regulators 
I Putative enzymes 



2883156 



2884088 



2885332 



2884205 



Putative enzymes 
I Putativ e enzymes 



2886551 



2885361 



2887336 



2886569 



i Transcriptional regulators 



2913921 



2914358 



Fatty acid and phospholipid metabolism 



2923926 



2923366 



ITranscriptional regulators 



2934387 



2933581 



Hypothetical, unclassified, unknown 
HypotheticaL unclassified, unknown 



2945263 



2945868 



2948581 



I Hypothetical, unclassified, unknown 



2948973 



2948976 
2949332 



[Hypothetical, unclassified, unknown 



2949332 



2949637 



I Hypothetical, unclassified, unknown 
ITranslation , post> translation^!j]i gdificatioh 



2i949634 



2949969 



2954695 



2953415 



Chaperones & heat shock proteins 



2956778 



2956152 



[Cell division ^ 

ITranslation. post-translational modificatioh 



2959239 



2956804 



2960455 



2961135^ 



. [Translation, post-translational modificatio 



2962002 



2962220 



[Hypothetical, unclassified, unknown 



2964842 



2964606 



Transcription. RNA processing and degr; 



2969986 



2971113 



Nucleotide biosynthesis and metabolism 



2972698 



[Energy metabolism 



2983204 



2974068 
2983881 



[Energy metabolism 



2986242 



2987688 



I Energy metabolisrh 



2992001 



2992501 



[Energy metabolism 



2992547 



2992855 



Hypothetical, unclassified, unknown 



3008323 



3008009 



[Hypothetical, unclassified, unknown 



3012536 



3012279 



Biosynthesis of cofactors. prosthetic grou 



301558 



3015937 



[Hypothetical, unclassified, unknown 



3016245 



3016598 



Hypothetical, unclassified, unknown 



3016883 



3016674 



I Protein secretion/export apparatus 



3020928 



3020503 



I Protein secretion/export apparatus 



3021338 



3020928 



I Protein secretion/export apparatus 



3021741 



3021307 



Transport of small molecules 



3025507 



3024704 



[Transcriptional regulators 

Amino acid biosynthesis and metabolism 



3028074 



3029003 



3031397 



3030435 



I Hypothetical, unclassified, unknown 



3042501 



3043415 



[Related to phage, transposon. or plasmic 
Energy metabolism 



3044765 



3043749 



3047969 



3047643 



Hypothetical, unclassified, unknown 



305061 



3050330 



Hypothetical, unclassified, unknown 



3057375 



3057608 



[Hypothetical, unclassified, unknown 



3060659 



3060264 



[Energy metabolism 



3070365 



3070703 
3074417 



Hypothetical, unclassified, unknown 



307373 
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{ Hypothetical, unclassified, unknown j 3075217! 3074579! 



1 Hypothetical, unclassified, unknown ! 3075410 


3075oo9i 


iHvDOthetical. unclassified, unknown 3075921 


307631 3 1 


Hypothetical, unclassified, unknown 


3076343 


307do21 


Central intermediary metabolism 


3079196 


3079834 


Hypothetical, unclassified, unknown 


3089612 


3088659 


Hypothetical, unclassified, unknown 


3090107. 


3089643 


Hypothetical, unclassified, unknown ! 


. 3094184 


3093657 


Hypothetical, unclassified, unknown 


3096051 


3094756 


Hypothetical, unclassified, unknown 


3098978 


3098625 


Hypothetical, unclassified, unknown 


3099729 


3099472 


DNA replication, recombination, modificat 


3100111 


3099809 


Translation, post-translational modificatio 


1 3102493 


3100115 


Translation, post-translational modificatio 


\ 3103544 


3102528 


Translation, post-translational modificatio 


1 3103998 


3103642 


Translation, post-translational modificatio 


T 3104216 


3104022 


Translation, post-translational modificatio 


1 3104829 


3104278 


Translation Dost-translational modificatio 


^ 3106751 


3104829 


nislA rfanlication recombination, modifica^ 


3110900 


.311161-3 


-ivnothetical unclassified, unknown 


3114816 


3115192 


-Ivonthetical unclassified unknown 


3117610 


3117176 


-Ivnnthetical unclassified unknown 


3119707 


3119393 


-lunnthptirfll unctassified Unknown 


3122264 


3121920 


Hvnothptical unclassified • unknown 


3122585 


3122376 


^utntivp PHTvmes 


3126251 


3127219 


HvDothetical unclassified, unknown 


3127225 


3127707 


Hvnothetical unclassified, unknown 


3128269 


3127859 


HvDothetical unclassified, unknown 


3132815 


3132228 


Hypothetical, unclassified, unknown 


3133257 


3132820 


Hypothetical, unclassified, unknown 


3137849 


3138193 


Hypothetical, unclassified, unknown 


3138190 


3138531 


Hypothetical, unclassified, unknown 


3139010 


3139669 


Hypothetical, unclassified, unknown 


3141718 


3142275 


Hypothetical, unclassified, unknown 


3142284 


3142502 


Hypothetical, unclassified, unknown 


3142617 


3143084 


Hypothetical, unclassified, unknown 


3149318 


3148722 


Hypothetical, unclassified, unknown 


3152201 


3150885 


Hypothetical, unclassified, unknown 


3155074 


3154592 


Hypothetical, unclassified, unknown 


3156502 


3156801 


Hypothetical, unclassified, unknown 


3157563 


3156859 


Hypothetical, unclassified, unknown 


3158925 


3159668 


Hypothetical, unclassified, unknown 


3160320 


3160583 


Hypothetical, unclassified, unknown 


3162215 


3161598 


Hypothetical, unclassified, unknown 


3162578 


3162387 


Transport of small molecules 


3165541 




Hypothetical, unclassified, unknown 


3173174 


3171597 


Hypothetical, unclassified, unknown 


3173243 


3173722 


Hypothetical, unclassified, unknown 


3180951 


3180553 


Hypothetical, unclassified, unknown 


3182399 


3182851 


Hypothetical, unclassified, unknown 


3184001 


3185128 


Adaptation, protection 


3185816 


3185160 


Hypothetical, unclassified, unknown 


! 3192748 


31935181 
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Putative enzymes 


3197641 


3198987! 


Hypothetical, unclassified, unknown 


3200552 


32003191 


Translation, post-translational modificatior 


1 3205079 


3204513 


Hypothetical, unclassified, unknown 


3206252 


3205122 


Membrane proteins 


3206914 


3207165 


HvDothetical. unclassified, unknown 


3208260 


3207289 


Transcription. RNA processing and degr^ 3212524 


3213030 


Secreted Factors (toxins, enzymes, algin^ 3215422 


3216288 


Hypothetical, unclassified, unknown 




3221564 


Hypothetical, unclassified, unknown 


OZZOOoO 


3227383 


Nucleotide biosynthesis and metabolism 


oZoUT Ol 


3229483 


Transcriptional regulators 


OZO 1 1/1 


3230278 


Transcriptional regulators 


od,oZi It 


3231881 


Hypothetical, unclassified, unknown 


32o0.79o 


3235960 


Hypothetical, unclassified, unknown 


3249630 


3249208 


Hypothetical, unclassified, unknown 


3253679 


325331 1 


HYpothetical. unclassified, unknown 


' 3255765 


3255406 


Hypothetical, unclassified, unknown 


3265210 


3264641 


Hypothetical, unclassified, unknown 


3271692 


3270826 


Hypothetical, unclassified, unknown 


3272405 


3271812 


Putative enzymes 


3277408 


3278577 


Hypothetical, unclassified, unknown 


3284597 


3283374 


Hypothetical, unclassified, unknown 


3291172 


3290693 


Hypothetical, unclassified, unknown 


3291282 


3291863 


Hypothetical, unclassified, unknown. 


3291959 


3292267 


Putative enzymes 


32971 17 


3295978 


Putative enzymes 


3308390 


3309337 


Energy metabolism 


331 1720 


3310791 


Energy metabolism 


00*1 *^ Add 

3o124Dy 


3311720 


Energy metabolism 


3312790 


3314445 


Motility & Attachment 


ooZUUz9 


3319673 


DNA replication, recombination, modifical 




3320063 


Nucleotide biosynthesis and metaDoiism 


OOZ ID # 


3321042 


Hypothetical, unclassified, unknown 


00^£. t sz 


3321703 


Fatty acid and phospnoiipiu metaDoiism 


wOZw lO^ 


3324946 


Fatty acid and pnospnoiipia meiaDoiism 


OOZ.U 1^ 1 


3325378 


Pflttv arid and nhosDholioid metabolism 


3327082 


3326144 


Translation, post-translational modificatio 


V 3328384 


3328202 


Hypothetical, unclassified, unknown 


3328934 


3328398 


Transcription. RNA processing and degrs 


•3332303 


3331347 


Cell wall / LPS / capsule 


3337230 


3336211 


Translation, post-translational modificatic 


1 3337691 


3337227 


Cell wall / LPS / capsule . 


3338455 


3337691 


Hypothetical, unclassified, unknown 


3338640 


3338455 


Cell wall / LPS / capsule 


3339676 


3338678 


Hypothetical, unclassified, unknown 


3340116 


3339676 


Transport of small molecules 


3340748 


3340113 


Hypothetical, unclassified, unknown 


3343177 


3343710 


Hypothetical, unclassified, unknown 


3345099 


3343798 


Transport of. small molecules 


3345795 


3345112 


Hypothetical, unclassified, unknown 


3347038 


3345788 


i Hypothetical, unclassified, unknown 


3346976 


3347740 
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iNliJclpotide biosvnthesis and metabolism 


33502311 3348837! 


H\/nnthpfirpl unclassified, unknown 




3350410! 

WWWW r 1 w • 




3354171 


3353497 




3360653 

wwwvwww 


3359268 

W W WW*a V W 


Kinripntirlp hinsvnthesis and metabolism 


3365743 


3365006 


Munnthptiral tinnlasslfied unknown 


336Q968 


3369035 


HMA rpniirflfion recombination, modlficat 


3372705 


3370099 

WWff WW**- 


Wx/nothptirai unclassified unknown 


3373170 

WW f w 1 f w 


3372796 

WW f ^ f w^i' \\ 


uiwnnfhAtiosil iinrla^slfied unknown 


3'^7A'^i i 


3378948 

wwf vlw^W 


LJv#nrtfhaftr«oi imHa^^iflpH unknown 


OOOOa^/ 


3334333 


uxfv^j^fKAfi^oi 1 innlaQcifipH' unknown 

YPOineuCali UilUlaoolliCU, ui irvi is-»v¥n 


oODO 1 OO 


3334377 


-'utative enzymes 


OQQ7QOQ 


3336269 


□losyninesis oi cuiauiuio, jjiuouicuo v^iwu 




OOi? •tUOw 


nVPOineuCai, unciaosnieu, uiiiviiuwm 


OOljl o / o 


00«7 1 Uww 


rjypotneucai, unciasoiiieu, uhmujwii 


0090/ lO 


33QQ643 


nypoineucai, unciaooiiieu, uiiimi*-'wm 


o*tUoO 1 1 


0*T w*r 1 *tW 


nypOinetlcai, unciassnitsu, uiiimiuwm 


04U4T44 


0*m*TW^*T 


"ty pot neti call unciassineu, uriisiiuwu 


o4U4ooo 


O^U*rOD 1 


"lypotneucai, unciassiiiea, uiitviiuwii 


o4U^90Z 


OyinOfiOR 

o^uyowo 


*rii»nMiMUjKkM v^/^^t fi*onclof tonal moHlfif^stlf) 

ransiSuOHt posi-iransiavioiicii iiiuuiuoauu 


1 3414400 


0414D 1^ 


iypolneticai, unciassuiea, unisnown 


3416000 


•o4 lO# oD 


Transcriptional regulators 


O A Ofi A^n 

3436429 


0400oOD 


Hypothetical, unciassinea, unisnown 


3457909 


o40D044i 


Hypothetical, unciassitiea, unknown 


3464151 


o40oooo 


-lypotheticali unciassinea, unKnown 


3466959 


o>icftn70 
o4doU/^ 


nypotheticai, unciassitiea, unKnown 


346708* 


O4D0vH?3 


Hypothetical, unciassinea, unKnown 


347301 / 


04 / *f 1 OO 


-'rotein secreuon/expon apparaiuo 


3476479 


04/ oyoo 


■'roteln secretion/expon apparaius 


o477Dii9 


04 / D**0 1 


Protein secreuon/expon apparaius 


o4o0zoo 


"^470790 
0*l / 57 / itO 


rrotein secreuon/expon appaioiu^ 


04004ZJ 


0*rO 1 w 1 0 


riypoineiicai, unciasouicu, uiirMiuwn 


o4yi4TU 


'^4Qn7'^i 

O'txJU # O 1 


~mM«« «>i>st^ ^r*t4 rkKi/*tcnhmltr\irl mptftHoli^nn 

anv aciQ ano pnoouiivjuuivj iiiciai/v.'iioiii 


o49oO 1 Z 


'^AQ97nn 


Amino aCIQ Diosyninesis aiiu niciauunoiii 


O0UU09 / 


0*tw%?*tOO 


riypoineuCBi, unciassiiieu, uiiimiuwii 


Qcnfto^'l 

oOUDZ*f 1 


3*^0^364 

OwwOOU^ 


tJt/»-vj^4-hAfir>ol 1 inr^laectfioH iinlcnnwn 


OOZ44oy 


3*^94160 


ijitfr\ofk Aft/^oi t mi^laccifipH unknown 


OO^ / / OO 




nypoineucai, unciassiiieu, uhimiuwii 


OOZ0049 


3*^93930 

OWt^O^ww 


oeii wan / Lr'o / capouic 


oozy4**D 


3'>98497 

Ow4^0*t^ « 


oeii wail / u"o / oapouic 


qcon4'^7 

ooou**o# 


3599507 


Oeil wan / LrO / Uapouic 


OOO 1 / U/ 


3530466 


oeii wan / uno / oapouic 


OOO^O 1*1 


3531750 

www 1 1 wW 


Ueil wail / Lr O / UapoUIC 


3^33Q39 


353281 1 

WwWbW ■ ■ 


Poll u/all / 1 Pf^ / ransule 


3535080 


3533947 


Amino acid biosynthesis and metabolisnn 


1 • 353597C 


3535215 


Amino acid biosynthesis and metabolism 


1 3536578 


3535970 


Cell wall / LPS / capsule 


353781 C 


1 3536575 


Cell wall / LPS / capsule 


3539123 


; 3537807 


Cell wall / LPS / capsule 


354020€ 


i 3539127 


Cell wall / LPS / capsule 


354078^ 


\ 3540209 


Cell wall / LPS / capsule 


354267C 


) 3540781 


Cell wall / LPS / capsule 


[ 354368S 


3542739, 
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Cell wall / LPS / capsule 


35450731 


35437631 


Cell wall / LPS / caosule 


3546926 


35458801 


DNA reDlication. recombination, modificat 3547972 


3547688} 


Translation Dost-translationai modificatio ^ 


3549788 


3548109] 


Nucfenfide biosvnthesis and metabolism 


3550745 


3550056 


Rio<?vnthesis of cof actors, orosthetic qrou 


3556338 


3555253 


"^MA rc^niinatinn rprombinatlon modificali 


3559197 


3556426 


Rinc\/nfh0ei« of rnfflrtors orosthetic arou 


3562105 


3562803 


-iwnr^thofir;:)! tinrl^^^ified unknown 


3569012 


3568635 




3571602 


3570940 


LJ\<r\/^fl-kAf!r«ol 1 innlacQlfipri OHknOWn 


3575696 


3574845 


V«/3rDOn COmpuuilU Uaiciui^iioin 


3587432 


3588436 


riypOXneilUol, Uiic-laooincui uiiiniiwvwii 


3593732 


3594031 


-iypoineiicai, unciassiiicu, ummiwvvii 


3594207 


3594569 


riypotneticai, unciaSoincu, uimn^jwii 


3597318 


3597797 


Transport or smaii moiecuies 


3600453 


3601598 


Transcripuonai reguiaiors 


3609075 


3609839 


Cnaperones « neai snocK proieins 


3614303 

WW 1 ~ Www 


3614866 


Hypotneticaii unciassiTieQ, unKnown- 


3618466 


3617342 


Putatlve enzymes 


"^619618 


3618992 


Hypothetical, unciassitiea, unKnown 


OW<1-*tOwW 


3625057 


Cell wall / LPS / capsule 


WWwWWT 


3629666 


Cell division 


W WWil^^ w 


3632683 


Transcription, RNA processing ana oegrc 


OwO^ # w r 


3633422 


Transcriptional regulators 




3635540 


Hypothetical, unciassmeo, unKnown 


-5641232 


3641810 


Transcriptional regulators 


'^648065 

Ww^U www 


3647754 


Adaptation, proieciion 


3653666 

wwwww 


3653875 


Hypothetical, unciassmea, unknown 


3668522 

W wWwW^b 


.3667923 


Hypotneticai, unciassiiiea, unKnown 


3668473 

WWWW~ 1 W 


3668760 


riypoineucai, unciassiTieu, uiiMitiwii 


3669168 

WWWW 1 WW 


3668839 


nypotneiicai, unciassiuea, unKnuwn 


3671057 

WW ■ 1 WW f 


3670758 


1 ranspon ot smaii rnuitjuuicro 


3674323 


3673007 


nypoineuccii, unuidooiiicu, uhimiwwii 


3675159 


3674569 




3679945 


3680460 


t-ix/rti^thAfiooi 1 in/^laccifi^rl iinknown 


3680967 


3680464 


i-i\/nr\thc3fioAl iinHa^^tfipd unknown 


3681047 


3681460 


l-l\/rknfhiatir»sl iinrlaQ^ifipH unknown 


3684716 


3684162 


t4\/nrkf hotinal i mrlsiQ^ifipH unknOWn 


3685761 


3684904 


M\/nrkfhptir»l 1 1 nrl?)<^sified unknown 


3695480 


3695178 


Hvnnthpfirpl unclassified unknown 


3700315 


3700785 


Hypothetical, unclassified, unknown 


3710224 


3710679 


Putative enzymes 


3715804 


3714914 


Transport of small molecules 


3716807 


3717610 


Transport of small molecules 


3717604 


3718437 


Hypothetical, unclassified, unknqwn 


3719328 


3720056 


Hypothetical, unclassified, unknown 


3720122 


3720622 


Hypothetical, unclassified, unknown 


3723444 


3722989 


Translation, post-translational modificatk 


)l 3730075 


3729470 


Putative enzymes 


3740104 


3741018 


Hypothetical, unclassified, unknown 


3742264 


3742689 


Fatty acid and phospholipid metabolism 


3743699 


3743938 
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Hypothetical, unclassified, unknown 


3747455 


37471651 


Transcriptional regulators 


3752477 


3752043! 


Hypothetical, unclassified, unknown 


3759680 


3759375) 


Chemotaxis 


3760819 


3759995 


Hypothetical, unclassified, unknown 


3762803 


3763126 


Hypothetical, unclassified, unknown 


3763680 


3764471 


HvDothetical. unclassified, unknown 


3765086 


3764475! 


Secreted Factors (toxins, enzymes, algin^ 3772560 j 


3771502, 


LJirrx/^fhotii^ol 1 inrlaQQiftPr) unknown 


3778265 


3778603 


-'Uiaiive enzymes 


3778703 


3779368 


nypometicai, unciassiiieu, uhmiuwh 


3779929 


3780075 


nypotneucai, unciassineu, unrviiuwii 


3780127 


3780312 


iypothetical, unciassiiiea, unknown 


^^787478 


3787020 


Transport of small moiecuies 




3790149 


Hypothetical, uncrassiTiea, unKnown 


'^7Q41 1R 

O t 53*T 1 1 \J 


3793814 


Energy metabolism 


OOV 1 1 \JO 


3801930 


Energy metabolism 


OOw 1 v7H / 


3802483 


Energy metabolism 




3802566 


Hypothetical, unclassifiea, unKnown 


ODUOOU^ 


3808317 


Transport of small molecules 


'^ftlii'^74 

OO l*TW f *T 


3813957 


nypothetical, unclassitieQ, unknown 




3820271 


Hypothetical, unclassifiea, unKnown 




3820891 


Energy metabolism 




3822514 


Hypothetical, unciassiTiea, unknown 


'^a40747 


3840358 


Transcriptional regulators 


00*TV/O*TW 


3841736 


delated to phage, transposon, or piasmia 




3842273 


Hypothetical, unciassiTiea, unknown 


^^844 104 


3843652 


Diosyntnesis or coiaciors, pruouidn* yiuu 


3845774 


3846334 


Diosyntnesis or coiacioro, pruouitsuu yiuu 


3846336 


3846707 


1 ranspoiT or smaii rnuiecuieo 


3849582 


3848794 


riypoxrieucoi, unuiaooiiicvi, uiiimiwvii 


3850751 


3849603 


ui%/r>rtfliattool 1 inHacQifioH I inlcnnxA/n 
nypOinSuCol, unulaoowicju, uniMiwwn 


3851800 


3850829 


riypOiri6UCcai| Ullwlctooliicvj, wiirwuwvii 


3852512 


3851919 


/\Qapiauun, protcuiiuii 


3856130 


3855492 


t-ix/nf^fhAtt^ol 1 irmlsicQififakH iinknnwn 


3856336 


3856545 


nUldllVt? dl^yiiico 


3867706 


3869463 


Hypothetical, unclassified, unknown 


3882979 


3883437 


Hypothetical, unclassified, unknown 


3885710 


3886306 


Transcriptional regulators 


3890649 


3889924 


Translation, post-translational modificatic 


1 3895323 


3897356 


Hypothetical, unclassified, unknown 


3906390 


3907088 


Hypothetical, unclassified, unknown 


3907267 


3907851 


Hypothetical, unclassified, unknown 


3910738 


3911772 


Hypothetical, unclassified, unknown 


3912409 


3913131 


DNA replication, recombination, modifica 


i 3913128 


3913766 


Hypothetical, unclassified, unknown 


3913875 


3914054 


Hypothetical, unclassified, unknown 


3918468 


3918250 


Hypothetical, unclassified, unknown 


.3918787 


3918470 


Hypothetical, unclassified, unknown 


3922072 


3921269 


Transport of small molecules 


3928333 


3927557 


Hypothetical, unclassified, unknown 


3936836 


3935799 


Hypothetical, unclassified, unknown 


3937431 


3937237 



I 
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Transport of small molecules J 


3943806 


3942649' 


Transcription. RNA processing and degrd 


3948137 


3948811 


Hypothetical, unclassified, unknown 


3949852 


3950073 


Hypothetical, unclassified, unknown 1 


. 3952386 


3952060 


Secreted Factors (toxins, enzymes, algina 


3965841 


3967010 


Secreted Factors (toxins, enzymes, algina 


3977183 


3977833! 


Hypothetical, unclassified, un.known 


3986879 


3987292 


Hypothetical, unclassified, unknown 


3997821 


3998114 


Carbon compound catabolism 


4003600 


4002107] 


Hypothetical, unclassified, unknown 


4004782 


4004958 


Hypothetical, unclassified, unknown 


4007506 


4008036 


Hypothetical, unclassified, unknown 


4010326 


4009541 


Carbon compound catabolism 


4023176 


4021971 


Hypothetical, unclassified, unknown 


4035757 


4035605 


Hypothetical, unclassified, unknown 


4036020 


4035757 


Hypothetical, unclassified, unknown 


4039173 


4040099 


Hypothetical, unclassified, unknown 


4040819 


4040103 


Transport of small molecules 


4043060 


4043830 


Hypothetical, unclassified, unknown 


4045159 


4045569 


Hypothetical, unclassified, unknown 


4045588 


4045809 


Hypothetical, unclassified, unknown 


4051557 


4051096 


DNA replication, recombination, modifica! 


4052603 


4051563 


Hypothetical, unclassified, unknown 


4062898 


4062425 


Hypothetical, unclassified, unknown 


4067294 


4067542 


Hypothetical, unclassified, unknown 


4068307 


4067603 


Hypothetical, unclassified, unknown 


4068611 


4068327 


Carbon compound catabolism 


4069965 


4068676 


Cell wall / LPS / capsule 


4070856 


4070011 


Nucleotide biosynthesis and metabolism 


4072487 


4070859 


Hypothetical, unclassified, unknown 


4073986 


4072658 


Fatty acid and phospholipid metaboiism 


4075006 


4074056 


DNA replication, recombination, modifica 


4078677 


4075156 


Cell wall / LPS / capsule 


4082180 


4081044 


Cell wall / LPS / capsule 


4082960 


4082184 


Fatty acid and phospholipid metabolism 


4083397 


4082957 


C II wall / LPS / capsule 


4084504 


4083443 


Transport of small molecules 


4086010 


4084604 


Transport of small molecules 


4087454 


4085061 


Biosynthesis of cofactors. prosthetic grot 


1 4090093 


4088903 


Fatty acid and phospholipid metabolism 


4090905 


4090090 


Cell wall / LPS / capsule 


4091654 


4090899 


Translation, post-translational modificatic 


>l 4092227 


4091670 


Nucleotide biosynthesis and metabolism 


4092967 


4092230 


Translation, post-translational modificatic 


)i 4094035 


4093166 


Translation, post-translational modificatic 


) 1 4094906 


4094166 


Translation, post-translational modificatic 


)1 4095171 




Hypothetical, unclassified, unknown 


4102767 


' 4103060 


Hypothetical, unclassified, unknown 


4103731 


4104078 


Cell wall /LPS /capsule 


410474^1 


4105778 


Transport of small molecules 


411108C 


4110346 


Hypothetical, unclassified, unknown 


4114932 


> 4115330 


Transcriptional regulators 


412046t 


4121106 
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ypothetical, unclassified. unl<nown 


4123955 


4123260! 


Hypothetical, unclassified, unknown 


4126089 


4125742 


Hypothetical, unclassified, unknown 


4126843 


4126163 


ypothetical. unclassified, unknown 


4130598 


4130942 


Hypothetical, unclassified, unknown 


4135972 


4135451 


ranslation. post-translational modificatioi 


4143664 


4142569 


Chemotaxis . . 


4148251 


4145942 


Hypothetical, unclassified, unknown 


4165879 


4165718 


DNA replication, recombination, modificat 


4172484 


4170769 


Hypothetical unclassified, unknown 


4173067 


4172528 


Hvnothetical. unclassified, unknown 


4182018 


4181377 


HvDothetical, unclassified, unknown 


4182769 


4182074 


HvnnthGticai. unclassified, unknown 


4183709 


4184938 


Translation Dost-translatlonai modification 


1 4195357 


4195007 


Transcription, RNA processing. and degrs 


4196157 


4195399 


Transcriotion. RNA orocessing and degr^ 


4196691 


4196164 


Translation, post-translational modificatio 


1 4196958 


4196707 


Prntpin secretion/exDort aooaratus 


4198541 


4197168 


W\/nnthptical unclassified, unknown 


4204736 


4204542 


H\/nnthptinal unclassified, unknown 


4205906 


4205295 


i-i\/nnthptiral unclassified, unknown 


4206664 


4207164 


3i ifsifiwp PDTvmp^ 


4209255 


4210277 


l-l\/nnthptiral unclassified unknOWH 


4221796 


4221212 


Mx/nnthpfiral unclassified, unknown 


4224115 


4223567 


Niiiripniirip biosvnthesis and metabolism 


4227236 


4225659 


H\#nnthetical unclassified, unknown 


4231070 


4232227 


Tmn^r^rlntinnal reaulators 


4234274 


4235182 


DNA reolication recombination, modifica^ 


4235219 


4236598 


Tran<;crintional reoulators 


4242294 


4241341 


Hypothetical, unclassified, unknown 


4243651 


4243079 


Hvnothetical unclassified, unknown 


4244184 


4243708 


HvDothetical unclassified, unknown 


4244875 


4245723 


Hypothetical unclassified, unknown 


4245809 


4246204 


Hypothetical, unclassified, unknown 


4255323 


4254736 


Hypothetical unclassified, unknown 


4260396 


4259254 


Hypothetical, unclassified, unknown 


4263492 


4262377 


Motility & Attachment 


4265287 


4264529 


Hypothetical, unclassified, unknown 


4266444 


4265305 


Nucleotide biosynthesis and metabolism 


4266900 


4266469 


HvDothettcal. unclassified, unknown 


4267344 


4267144 


Energy metsbolism 


4267709 


4267371 


Chaperones & heat shock proteins 


4269575 


4267716 


Chaperones & heat shock proteins 


4270139 


4269618 


Biosynthesis of cofactors, prosthetic grot 


J 4270470 


4270147 


Biosynthesis of cofactors, prosthetic groi 


J 4270884 


4270498 


Hypothetical, unclassified, unknown 


4272655 


4272164 


Protein secretion/export apparatus 


4278946 


4277084 


Hypothetical, unclassified, unknown 


4279344 


\ 4279006 


Hypothetical, unclassified, unknown 


4285483 


1 4284416 


Hypothetical, unclassified, unknown 


428659^ 


4285476 


Hypothetical, unclassified, unknown 


428770S 


4286786 


Hypothetical, unclassified; unknown 


4290866 


4291234 
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Translation, post-translational modificatip i 


4291355 


4294207 


Hypothetical, unclassified, unknown 


4294604 


4294^04 


Hypothetical, unclassified, unl<nown 


4302040 


4303050 


Hypothetical, unclassified, unknown 


4305063 


430o4Zo 


HvpotheticaL unciassified. unknown 


4311861 


4310950 


Hypothetical, unclassified, unknown 


4312702 


4311950 


Hypothetical, unclassified, unknown 


4314794 


4314480 


Hypothetical, unclassified, unknown 


4316037 


4315555 


Hypothetical, unclassified, unknown 


4316824 


4316108 


Putative enzymes 


4318258 


431 8905 


DNA replication, recombination, modifica j 


4330320 


4330895 


Hypothetical, unclassified, unknown 


4331451 


4332461 


Hypothetical, unclassified, unknown 


4332652 


4333035 


Transport of small molecules 


4343527 


4342121 


Hypothetical, unclassified, unknown 


4350352 


4350738 


Hypothetical, unclassified, unknown 


4351594 


4352493 


Transport of small molecules 


4354543 


4355265 


Transport of small molecules 


4356200 


4356862 


Transport of small molecules 


4366875 


4358038 


Hypothetical, unclassified, unknown 


4359074 


4358166 


Hypothetical, unclassified, unknown 


4373937 


4374332 


Hypothetical, unclassified, unknown 


4374329 


4374856 


Hypothetical, unclassified, unknown 


4374849 


4375232 


Hypothetical, unclassified, unknown 


4382015 


4381500 


Biosynthesis of cofactors, prosthetic grou 


4386351 


4385899 


Biosynthesis of cofactors. prosthetic grou 


4386607 


4386356 


Biosynthesis of cofactors. prosthetic grou 


4387086 


4386604 


Trahsport of small molecules 


4414949 


4414131 


DNA replication, recombination, modificai 


4418302 


4418583 


Hypothetical, unclassified, unknown 


4439376 


4439783 


Hypothetical, unclassified, unknown 


4442452 


4442868 


Transcriptional regulators 


4444977 


4446486 


Hypothetical, unclassified, unknown 


4445998 


4446390 


Hypothetical, unclassified, unknown 


4447132 


4448217 


Transcriptional regulators 


4453182 


4452535 


Biosynthesis of cofactors, prosthetic grou 


4457361 


4458644 


Hypothetical, unclassified, unknown 


4459749 


4459417 


Hypothetical, unclassified, unknown 


4461387 


4462409 


Hypothetical, unclassified, unknown 


- 4462399 


4462881 


Translation, post-translational modificafic 


n 4463903 


4465438 


Hypothetical, unclassified, unknown 


4466753 


4466322 


Translation, post-translational modificatic 


n 4466924 


4469545 


Hypothetical, unclassified, unknown 


4469612 


4470235 


DNA replication, recombination, modifica 


i 4470273 


4471310 


Hypothetical, unclassified, unknown 


4471402 


4471554 


Related to phage, transposon, or plasmk 






Biosynthesis of cofactors, prosthetic groi 


i 4477976 


4476993 


Hypothetical, unclassified, unknown 


4478907 


4478626 


Cell wall / LPS / capsule 


4483363 


4482260 


Hypothetical, unclassified, unknown 


44861 7S 


1 4485823 


Hypothetical, unclassified, unknown 


4486846 


i 4486202 


Putative enzymes 


446840S 


1 4469632 



wo 03/089572 



139 



jSecreted Factors (toxins, enzymes, alqina 4713795 


47142831 


ic^pnrftted Factors rtoxins. enzvmes, algini^ 4714313 


4714801 i 


|c 


Secreted Factors (toxins, enzymes, algina 4714826 


4716042 


< 


Secreted Factors (toxins, enzymes, alcjina 4718666 


4719392 


< 


Secreted Factors (toxins, enzymes, alain^ 4719418 


4720062 




Hypothetical, unclassified, unlcnown | 4724034 


4722850 




rancnnrt nf small molecules ! 474481 8 


4745123 


r 


■^KIA rai^Ii/^oHrin rof*r»mhin?ition modiflcal 


4747136 


4746639 




r«>*nnoiir^fir\n nrtcf-frariQlfltinnal modification 
ransiauon, puov nciiioiciuvjiiai MiA-fwuivyowiv-r • 


4754378 


4753989 




ranscripuon, r\iNM piuuc&oiuy onu uc^v^ic 


4755423 


4754422 




Ml^tiMrt n/%ef froncl^timiAl ITlOdiftCatiO 1 


4756066 


4755446 




ranslation, posi-iransiauoncii inuuinucauu i 


4756472 


4756083 




ranslation, posi-uansiauunai iiiuuniuauu 


4756847 


4756491 




ranslation, post~transiauona» moauioauu 


4757094 


4756978 




Protein secretion/export apparaius 


4758451 

*T 1 W W— Tw t 


4757123 




Translation, post-transiationai moamcaiion 


4758886 


4758452 




Translation, post-transiaiionai moamcauo^ 


1 4759066 


4758890 




Translation, post-transiationai moaiTicauo 


\ 4753569 


4759069 




Translation, post-transiationai moaiTicauo 


1 4753323 


4759573 




Translation, post-translationai moairicaiio 


1 il7fin467 

I *T # wW*tW r 


4759934 




Translation, post-transiauonai moaiTicaiio 


1 4760871 


4760479 


Translation, post-transiationai moaiTicauo 




4761061 


Translation, post-translationai moomcaiio 


1 4761919 


4761380 


Translation, post-transiationai moamcauo 


n 4762253 


4761939 


Translation, post-transiauonai moomcaiio 


T 476263^ 


4762266 


Translation, post-transiauonai moaiucauo 


1 4762924 


4762658 


Translation, post-transiauonai moaiiiuauo 


1 4763118 


4762927 


Translation, post-transiauonai moamcauo 


n 476353' 

1 *T / W WWW 1 


4763118 


Translation, post-transiaiionai moumcduu 


h 4764229 


4763543 


Translation, posi-iransiaiionai, mouiiii.*ouu 


T 4764574 


4764242 


Translation, post-iransiauonai muuinu<ai.iu 


1 4764862 


4764587 


1 ranslation, posi-iranoiauuiiai niuuiin-auw 


1 4765700 


4764879 


Tr'^nolofiokn nocf.tranQlstinnal modiflcatio 

1 iSllSiauOn, pOol-U dl lolcillWl lai 1 1 iwvjinwaMv-» 


1 4766011 


4765712 


1 iSnSiailOn, POSI iraliolauWHcii 1 1 iwuiiiv/cain-r 


1 4766610 


4766008 


Trane>)otir>n nocf^f rsmclntinnal modificatio 
1 ranoiaiion, uooi iidiiwiciuv/iiai iiiwxjmwouw 


1 4767259 


4766624 


Tronciotinn nnct^f mn^lational modificatio 


1 4767653 


4767.342 


Tronciatinn nnst-translatlonal modiflcatlo 


1 4771655 


4771185 


Translation, post-translational modificatio 


n 4772126 


4771755 


Transcription. RNA processing and degre 


4776477 


4772278 


Transcription. RNA processing and degrj 


4780616 


4776543 


Translation, post-translational modificatic 


n 4781206 


4780838 


Translation, post-translational modificatic 


n 4781785 


4781285 


Translation, post-translational modificatic 


)i 4782679 


4781984 


Translation, post-translational modificatic 


)l 478311C 


1 4782679 


Transcription. RNA processing and degn 


i 478376C 


1 4783227 


Protein secretion/export apparatus 


4784138 


HIO'Sl l\J 


Hypothetical, unclassified, unknown 


478747E 


) 4786733 


Hypothetical, unclassified, unknown 


4819927 


' 4820409 


Two-component regulatory systems 


4820531 


4821358 


Hypothetical, unclassified, unknown 


482336c 


4823079 


Hypothetical, unclassified, unknown 


482412; 


4823386 


Hypothetical, unclassified, unknown 


483055: 


4829642 
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1 Hypothetical, unclassified, unknown 



Nucleotide biosynthesis and metabolism 



4830963 



4843551 



4831181 



4842700 



iHypotheticai. unclassified, unknown 



H ypothetical, unclassified, unknown 



4851309 



4852316 



4854008 



4853649 
4858410 



I Carbon compound catabolism 



4856959 



Putative enzymes 
Transcriptional regulators 



Hypothetical, unclassified, unknown 



4859262 



4858489 



4870327 



4869557 



4875244 



4874654 



Hypothetical, unclassified, unknown 



4877605 



4876820 



IHypotheticai. unclassified, unknown 



Hypothetical, unclassified, unknown 



4878586 



4877690 



4878789 



4879544 



[Hypothetical, unclassified, unknown 



4882052 



4882354 



Hypothetical, unclassified, unknown 



4884960 



4884718 



Hypothetical, unclassified, unknoyyn 



4887505 



4887278 



lAdaptation. protection 



Hypothetical, unclassified, unknown 



4893696 



4894277 



4902201 



4903295 



Hypothetical, unclassified, unknown 



4909403 



4909161 



Hypothetical, unclassified, unknown 



Chaperones & heat shock protelris 



4914509 



4914126 



4917123 



4915480 



I Chaperones & heat shock proteins 



4917467 



4917174 



[Hypothetical, unclassified, unknown 



4918932 



4918198 



Putative enzymes 



4919041 



4919799 



[Hypothetical, unclassified, unknown 



4Q21980 



4922363 



Hypothetical, unclassified, unknown 
I Protein secretion/export apparatus~ 



4925794 



4925315 



4936616 



4933865 



[Hypothetical, unclassified, unknown 



4937819 



4938214 



I Cell wall / LPS / capsule 



4939186 



4938275 



I Cell division 



4940483 



4939299 



Cell division 



4941787 



4940534 



Cell division 



4942672 



4941809 



iCell wall / LPS / capsule 



4945074 



4943632 



I Cell wall / LPS / capsule 



Cell division 



4946140 



4945067 



4947329 



4946130 



[Cell wall / LPS / capsule 



4948675 



4947329 



ICell wall / LPS / capsule 



iCell wall / LPS / capsule 



4949771 



4948689 



4951147 



4949771 



I Ceil wall / LPS / capsule 



4952603 



4951140 



[ Ceil wall / LPS / capsule 



4954342 



4952603 



{Cell division 



4954632 



4954339 



[ Hypothetical, unclassified, unknown 



4955570 



4954629 



Hypothetical, unciassified, unknown 



4956028 



4955573 



[Hypothetical, unclassified, unknown 
Putative enzymes 
Adaptation, protection" 



4959519 



4959896 



4959925 



4960518 



4961557 



4961150 



[Adaptation, protection 



4962186 



4961569 



Energy metabolism 



4964265 



4963054 



Translation, post-translational modificatio ^ 4965500 



4965108 



iTransiation. post-translationai nnodificati6i 4965943 



4965515 



[ Transcriptional regulators 



4968711 



4969610 



[Hypothetical, unclassified, unknown 



4971890 



4970796 



Translation, post-translational oiodificatio i 4973331 



4971985 



IHypotheticai. unciassified, unknown 



4974028 



4973399 
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Cell wall / LPS / capsule ! 4985469 


4984204] 


HvnothGticaL unclassified, unknown ! 4986154 


49858461 

. ' — 1 


HvDothetical unclassified, unknown 


4986798 


49861 5H 


Hvnothetica! unclassified, unknown 


4987283 


4986810 


'ransport of small molecules 


4988081 


49872841 


Hvnotheticai unclassified, unknown 


4989304 


4990284] 


Mx/nnfhptinal unclassified, unknown 


4990832 


4991404, 


Hvnnthptical unclassified. Unknown 


4991391 


4991918 


'ronennrt nf Qmsll molGCUisS 


4991918 


4992643 


'roneorintiDnfll rpfiulp^tors 


4992869 


4994362 


ij\#rki^thAHnoi iinrl»^c;lfted unknown 


4994440 


4994748 


*i><^ne>rk/*\r4 of email mrilpPlJiSS 

ranspon oi oiiiciii miuicv^ui^o 


4994762 


4995226 


ranspon ot biiiaii muicouico 


4996118 


4996390 


LjiJM«^4'ViAfi/^oi 1 inr'IscQiflpH tinknoWH 


5000303 


4999908 


oeii.aivision 


5012313 


5011321 


Cell division 


5013430 


5012393 


1 ransiation, posi-iransiouoiiai iiiuum^auw 


1 5013670 


5013960 


1 rSnSiaiiOn. P05l"UalIoialluliai iiiwumvauw 


1 5013973 


5015427 


ransiation, posi-iransiauunar iiiuuinuauu 


1 5015534 


5016979 


■iypotneiicgi, unciassiiieui unisiiuwn 


5017039 


5017416 


"lypotneiicai, unciassiTieq, uiiiNimwii 


5028230 


5027421 


ranscnptionai reguiaiois 


5036244 


5036807 


iypotnetlcal, unciassinea, unknown 


5046663 


5046031 


Biosynthesis ot coiaciors, prosineuc giuu 


5068031 


5068879 


viotility & Attacnmeni 


5069530 


5069081 


viotility OL Atiacnmeni 


5069762 


5071462 


viotiuty <x Attacnmeni 


5071566 


5072690 


11, .^^xK s^mI t initio ccifioH unknown 

riypotneTicai, unciassirieu, umimiuwii 


5073563 


5074174 


1 1 irkr«iocetftoH iinl^nnwn 

lypotneiicai, unciassuieu, uiiimiuwm 


5074171 


5074371 


Jv««%y^4(^ Afi/^<ai 1 im/^toccifipH iinl^nnwn 

■lypotneticai, unctassiiieut uniMiuvvii 


5077535 


5077705 


1 ranscnpiion, KINM proceo&niy anu ucyie 


5091814 


5090852 


1 WO-COmponeni reyuiciwjiy oyoicnio 


5094984 


6096321 


vioiuiiy IX /\uacnmeiu 


5099262 


5100086 


Mr%fili4\/ A. Atfarhmpnt 
iviomiiy ot Miiocnmciu 


5100083 


5100670 


/\aapiaiion, proicuiiuii 


5105929 


5104985 


Tronelofion nocf.frsnQlntlnnsI modifiCStlC 
1 IglllSlallUll, JJw&l U Cll lOIOUWl Iwl 1 1 iv«inwwMw 


1 5106957 


5106448 


Xronclofinn nnQf.trsnstafional modlficatlO 


1 5109781 


5106950 


RioewnthciQic nf rinfantors Drosthetlc QTOl 


5110743 


5109805 


TranQiatinn nn^t-translational modificatlo 


n 5112662 


5112937 


Mx/nothpfiopl iinrlsssified unknown 


5113468 


5113004 


Aminn ariH hinsvnthesis and metabolism 


5114598 


5113480 


AHsntstinn nrofpction 


5115889 


5114669 


Translation, post-translational modlficatic 


)^ 5116288 


5116031 


Translation, post-translational modlficatic 


)l 5116623 


5116312 


Biosynthesis of cofactors. prosthetic groi 


1 5116864 


5117832 


Hypothetical, unclassified, unknown 


5122386 


5121898 


Hypothetical, unclassified, unknown 


5122901 


5122560 


Hypothetical, unclassified, unknown 


512593C 


1 5125604 


Hypothetical, unclassified, unknown 


5135816 


i 5136192 


Transcriptional regulators 


515556C 


1 5156123 


Hypothetical, unclassified, unknown 


5162446 


5162068 


Hypothetical, unclassified, unknown 


516875^ 


5169188 
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vnothetlcal. unclassified, unknown ! 5169504 


5169250! 


ypothetical. unclassified, unknown 


5174979 


5176103! 


Hypothetical, unclassified, unknown 


5197880 


5198323; 


ypothetical, unclassified, unknown 


5204927 


5206087 


ypothetical, unclassified, unknown 


5206486 


5206208 


Hypothetical, unclassified, unknown 


5207006 


5206719 


ypothetical. unclassified, unknown 


5210995 


5210705 


ypothetical. unclassified, unknown 


5212104 


5211631 


ucleotide biosynthesis and metabolism 


5212832 


5213470 


ypothetical, unclassified, unknown 


5215669 


5216202 


nhanerones & heat shock proteins 


5216763 


5217651 


Rinsvnthesis of cofactors. prosthetic grou 


5223561 


5222539 


Hpaii wall / LPS / caosule 


5230910 


5230113 


Rinwnthesis of cofactors. prosthetic grou 


5231658 


5230900 


Translation, Dost-translational modificatio" 


1 5233566 


5232484 


Translation post-translational modificatio " 


1 5234852 


5233584 


Hell wall / LPS / caosule 


5236773 


5237390 


Riosvnthesis of cofactors, prosthetic grou 


5237392 


5238240 


Nucleotide biosynthesis and metabolism 


5238407 


5239348 


Translation Dost-transiational modificatio 


1 5239465 


5240079 


Tranciafinn nn<5t-translational modificatio 


1 S240121 


5240705 


i-i\/nnfhi:ktiro| iinclassified. unknown 


5242558 


5242253 


rutaiivc cHifcyiiigo 


5246122 


5245475 


UwrinthAfiral unolasslfied. UnknOWn 


5248771 


5248070 


Mx/nnthofirAl ijnria<?sified UnknOWH 


5250617 


5249598 


PaHv anici and nhosohotiDid metabolism 


5272299 


5271484 


Amino »riri biosvnthesis and metabolism 


5275731 


5274007 


Mvn'othetical unclassified, unknown 


5276282 


5276734 


Hv/nnthpfiral unclassified, unknown 


5277171 


5276842 


H\/nothetlcal unclassified, unknown 


5277950 


5277171 


Hvnnthetical unclassified, unknown 


5282158 


5282505 


Transnort of small molecules 


5286034 


5285267 


Hvnothetical unclassified, unknown 


5291613 


. 5291960 


Hvnothetical unclassified, unknown 


5297482 


• 5297006 


Biosvnthesis of cofactors, prosthetic grou 


5310725 


5311213 


Biosvnthesis of cofactors, prosthetic grou 


5311463 


5312263 


Biosvnthesis of cofactors. prosthetic grou 


5313205 


5313585 


Enerav metabolism 


5313675 


5315339 


Hypothetical, unclassified, unknown 


5322234 


5322449 


Hypothetical, unclassified, unknown 


5322706 


5322509 


Hypothetical, unclassified, unknown 


5323100 


5322756 


Transcriotion. RNA processing and degn 


= 5325478 


5323373 


Translation, post-translational modificatic 


)1 5325921 


5325652 


Translation, post-translational modificatic 


)i 5329948 


5327426 


Transcription. RNA processing and degn 


i 5331457 


5329976 


Hypothetical, unclassified, unknown 


5331 96C 


Www 1 Sj\J£m 


Protein secretion/export apparatus 


5332742 


! 5332353 


Energy metabolism 


533350C 


) 5332745 


Cell wall / LPS / capsule 


5334902 


\ 5333566 


Biosynthesis of cofactors, prosthetic groi 


J 5335771 


5334920 


Cell division 


5338522 


5337899 


Hypothetical, unclassified, unknown | 5338611 


r 5338931 
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Hypothetical, unclassified, unknown 



5343754" 53431041 



Amino acid biosynthesis and metabolism i 



Chaperones & heat shock proteins 



53458911 5345085 
53497601 5349200 i 



Transcriptional regulators 



5352078 



5351674] 



Transport of small molecules 



5352176 



. 53527061 



Hypothetical, unclassified, unknown 



5353506 



53530721 



Hypothetical, unclassified, unknown 



5361585 



53620671 



Two-component regulatory systems 



5364070 



5364735 



Transcriptional regulators 



53662561 5366654 



H ypothetical, unclassified, unknown 



5370720 



53704751 



Hy pothetical, unclassified, unknown 

- . .. ' I i^^^lK^A tiriLrnri\A/n 



5376851 



5377708 



Hy pothetical, unclassified, unknown 



5377790 



Hypothetical, unclassified, unknown 



5378092 



5378095 
537884lJ 



Hypothetical, unclassified. unl<nown 



Related to phage, transp oson. or plasmic 



5380447 



5379512! 



5383811 



5382795 
5387721 



ypothetical. unclassified, unknown 



5386999 



Energy metabolism 



5396858 



53959291 



=attv acid and phospholipid metabolism 



5402944 



5402015] 



ypothetical. unclassified, unknown 



5414487 



5414278] 



ypothetical. unclassified, unknown 



5418568 



5418350 



ypothetical. unclassified, unknown 



•5419862 



5420317 
54230651 



Transcriptional regulators 



5422505 



■ivpothetical. unclassified, unknown 



5434651 



5434115 



-atty acid and Phospholipid metabolism 5442303 



54427731 



-atty acid and phospholip id metabolism 



Translation, post-translational modificatioj 



5442791 
5445198 



54441401 



54460821 



Tp.ncr>ripfiAn, R NA processing and de grd 5448642 
— ■- ^^-^ ' 5460525 



5448965 



Transport of small molecules 



5461382 
54636041 



Central intermediary metabolism 



5462762 



riypothetical. unclassified, unknown 



5463917 



54644351 



Central intermediary metabolism 



5464820 



5466520 
54680161 



Hypothetical, unclassified, unknown 



5468282 



riypothetical. unclassified, unknown 



5468408 



5469022 
5471625] 



Hypothetical, unclassified, unknown 



5472041 



•iypothetical. unclassified, unknowm 



5472438 



5472734J 



Transcriptional regulators 



5473765 



5474577 



Two-component regulatory systems 



5480401 



5481090 
5482451 



Transport of small molecules 



5483767 



Hypothetical, unclassified, unknown 



Biosynthesis of cofactors. prosthetic grot 



5486355 
5487714 



5486984] 



Hypothetical, unclassified, unknown 



5489040 



5488385 
54896121 



Transcriptional regulators 



5490651 



5489629 
5505070 



Transcriptional regulators 



5505783 



Hypothetical, unclassified, unknown 



5507897 



5506965^ 



Hypothetical, unclassified, unknown 



5517093 



5516398] 



Rinsynth esis of cofactors. p rosthetic grou 551971 0 

— -- — ■■ • >"--■ --■ r 5522385 



5520537 
5522972] 



Hypothetical, unclassified, unknown 



Hypothetical, unclassified, unknown 



5524718 



55238671 



Hypothetical, unclassified, unknown 



5525904 



5524969J 



DNA replication, recombination, modificat 
Translation, post-translational modificatioj' 



5535555 



55341611 



5537288 



Translation, post-translational modificatioh 5537737 



5537058 
55373181 
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1 Nucleotide blosvnthesis and metabolism 1 5543364 


55420721 


iHvnnthetical unclassified, unknown 


5544820 


5544635- 


HvDothetical unclassified, unknown 


5548644 


5548396 


Translation, oost-translational modificatioh 


5549721 


5548750 


HvDothetical unclassified, unknown t 


5553583 


5553116 


Hvnothetical unclassified, unknown • 


5556934 


5557953 


In^antral intermediarv metabolism ; 


5561598 


5562413 


iHvnnfhetiral unclassified. Unknown ! 


5569089 


5570627 


luwi^nthotirai linriasslfied unknown 


5570624 


5571160 


InMA roniirafinn rpcombination. modifical; 


5574485 


5572221 


1 i-i\/nnfh<atir flt unclassified unknown 


5575017 


5574493 


u\/nnfh<afirsii unclassified, unknown 


5576027 


5575014 


InMA roniirai-inn rpcom hi nation, modificaf 


5577916 


5576027 


lutmnfhotirfli iinrlasfiified unknown 1 


5579498 


5578680 


lui\/r\rkfHotii^9i 1 inrl^^Q^ified unknown 


5580961 


5581707 


irUiauve enzyrTicb 


5593423 


5592632 


jueu wall / Lro / capsule 


5606097 


5603820 


1 A M«^:u.!^^t^ r^O'i^'fon/^A onr4 ci tcf^AotihilitV 

jAntiDtOuC resisiance aiiu ouoocijuuiniy 


5606102 


5606434 


Inypoineiicai, unciassiiieu, umimiuwh 


5606495 


5607670 


Inypotneiicai, unciassiTicu, uiimiuwu 


5607667 


5608479 


ITransport ot smaii moiecuies 


5615543 


5613732 


Inypotneiicai, unciassmea, uiiiniiuwii 


5615651 


5616301 


inypotneiicai, unciasoiTieu, uiii\nuwii 


5626378 


5624900 


IHypouieiicai, unciasoiTieu, uiimiuwh 


5627133 


5626375 


IHypotneticai, unciassmea, uiiisiiuwm 


5627864 


5627130 


Cell wall / Lro / capsuie 


5628670 


5627864 


ICeil wail / Lro / capsuie 


5629788 


6628667 


|Ceii wan / lkq / capsuie ^ 


5630852 


5629785 


Oell wall / Lro / capsuie 


5631886 


5630849 


|riypouieucai| unciassiiipu, uuimiuwii 


5659184 


5668417 


1 1 ranscnpiionai reguiaiwio 


5662741 


5663814 


lt-l\/nr\fhafir«o1 I innln^^tftpd UnknOWn 


5663888 


5664868 


1 QSj^c\/nfKocic r\f nnfflf*tnr^ nrosthstic QFOL 


5666056 


5664989 


1 Aminrk or»irt KinQX/nthe^lQ and metatSOllsm 


5675701 


5675183 


llVIUUlliy 0( rMLcll^l II MCI li 


5680712 


5679648 


InKiA r^^ntiratinn reconnbination. modifica 


5687495 


5689714 


iTranciatinn noQt-translational modiflcatic 


n 5689963 


5691726 


1 i-i\/nnthotirai i mrlassified unknown 


5691761 


56924.56 


iMunnfhfatiral nnclasslfied unknown 


5702113 


5701697 


lRin<5\/nthpi^i^ of cofactors Dposthetic aroi 


J 5702668 


5703438 


iMv/nnthofirai unclassified unknown 


5703453 


5704079 


li-i\/nnth<atiral unclassified unknown 


5704076 


5705677 


1 Amino arid biosvnthesis and metabolisnn 


1 5706190 


5706525 


Protein secretion/export apparatus 


5706551 


5706799 


Hypothetical, unclassified, unknown 


• 570803 


> 5708742 


Chemotaxis 


570895^ 


571 0897 


Hypothetical, unclassified, unknown 


572046£ 


5720944 


TranscriDtional regulators 


5723581 


5724537 


Carbon compound catabolism 


575347; 


5752463 


Central intermediary metabolism 


575414: 


J 5753613 


ITranscriptlonal regulators 


5762145 


) 576257 


lAmtno acid biosynthesis and metabolisn 


1 576648: 


J 5767892 
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Protein secretion/export apparatus 


57770961 57766051 


Nucleotide biosynthesis and metabolism 


57773871 5777133; 


Hypothetical, unclassified, unknown 


57/7808 


5777389 


Carbon compound catabolism 


5778133 


5779680 


Hypothetical, unclassified, unknown 


5779994 


5780812 


Amino acid biosynthesis and metabolism 


5791085 


5790444 


Hvnnthetical. unclassified, unknown 


5791836 


5792234 


Hvpr^^hPtirpI iinnlassified. unknown 1 5797064 


5797336 


Transport of small molecules 




5802823 


Cell wall / LPS / capsule 


OO lUilOU 


5811338 


Cell wall / LPS / capsule 


Otj \ 1 ooo 


58122431 


Cell wall / LPS / capsule 


OO 1 ^^hv 


5813121 


|Cell wall /LPS /capsule 


OO lO lil 1 


5813666 


lAmino acid biosynthesis and metabolism 




5825718 


Hypothetical, unclassified, unknown 


CQOQif ftO 

oo^yAoy 


5828903 


Hypothetical, unclassified, unknown 


oooUf 4y 


5830312 


Hypothetical, unclassified, unknown 


booo4ol 


5835894 


Putative enzymes 


5840894 


5839104 


Putative enzymes 


5843799 


5843197 


Chaperones & heat shock proteins 


p84oODO 


5847971 


Putative enzymes 


5879970 


5878753 


iHvDothetical. unclassified, unknown 


5880481 


5879999 


iTranslation, post-translationai moaincauo 


1 5883012 


5881678 


Hypothetical, unclassified, unknown 


OOOoOf O 


5883022 


Hypothetical, unclassified, unknown 


5883971 


5884285 


Hypothetical, unclassified, unknown 


5885485 


5885943 


Hypothetical, unclassified, unknown 


5896726 


5895260 


Transcription. RNA processing and degrd 


5900123 


5898864 


Energy metabolism 


5900694 


5900368 


Hypothetical, unclassified, unknown 


5907389 


5907862 


Hypothetical, unclassified, unknown 


5908329 


5907847 


Biosynthesis of cofactors, prosthetic grou 


5921496 


5920741 


Biosynthesis of cofactors. prosthetic grou 


5922434 


5921493 


Hypothetical, unctessified, unknown 


5941081 


5940746 


Cell wall / LPS / capsule 


5941335 


5941475 


{Amino acid biosynthesis and metabolism 


5942744 


5943574 


Putative enzymes 


5945234 


5945932 


jcentral intermediary metabolism 


5952820 


5952482 


Idna reolication. recombination, modifica 


i 5962715 


5964724 


i Fnerav metabolism 5969764 


5969354 


Hypothetical, unclassified, unknown 


5972202 


5971849 


Translation, post-translational modificatic 


)! 5986118 


5985882 


IDNA replication, recombination, modifies 


\\ 598931 S 


5988645 


iDNA replication, recombination, modifica 


1 598945S 


5990667 


1 Nucleotide biosynthesis and metabolism 


599067E 


► 5991130 


Hypothetical, unclassified, unknown 


5996035 


) 5996985 


Energy metabolism 


6000142 


[ 5999753 


Hypothetical, unclassified, unknown 


6001377 


' 6000760 


1 Nucleotide biosynthesis and metabolism 


600203£ 


) 6001398 


Hypothetical, unclassified, unknown 


6003325 


) 6002958 


Transcription. RNA processing and degr 


d 600409( 


) 6003376 


HvDothetical. unclassified, unknown 


600427( 


3 6005139 
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Nucleotide biosynthesis and metabolism | 6005198 


OUOOOU^ 


HvDothetical. unclassified, unknown ; 6008397 


oOOof / f 


Hypothetical, unclassified. unl<nown 


6017013 


601 0021 


Carbon compound catabolism 


6018996 


601oo29 


Carbon compound catabolism 


6019347 


601 9180 


Biosynthesis of cofactors. prosthetic grou 


6025304 


6026194 


Two-component regulatory systems 


6032267 


6031365 


Hypothetical, unclassified, unknown i 


6060199 


6059822 


Hypothetical, unclassified, unknown 


6063831 


6063352 


Putative enzymes 


6069098 


6067944 


HvDothetical. unclassified, unknown 


6074229 


6075206 


TranscriDtional regulators i ' 6082866 


6083072 


Hvnnthetical unclassified, unknown* 


6083104 


6083508 


Hvnnthiatinfll unclassified unknown 


6083752 


6084084 


i-i\/nnfhfitirfll iinrlassified Unknown 


6084644 


6084368 


Ofiirhnn r»nmnrt!inri riafabolism 


6096706 


6097026 


Munnthotiral imria^sified unknown 


6148318 


6149181 


-Iwnnthotiral iinrla^^ified Unknown 


6151313 


6151525 


-tvnnthpfiral unclassified unknown 


6155202 


6154783 


-tunnfhptirai tinrlasslfied unknown 


6158178 


6158942 


TranQiafinn nn^t-translational modificatio 


n 6159562 


6158948 


-Ivnnthptiral unclassified unknown 


6171732 


6171926 


-Jv/nnthptical unrlassifted unknown 


6172872 


6172711 


H\/nnf-hptiraf unclassified unknown 


6188864 


6188165 


Tran^nort of ^mall molecules 


6195308 


6196315 


HvDothetical unclassified, unknown 


6219068 


6218856 


TransDort of small molecules 


6221100 


6222857 


Transport of small molecules 


6225924 


6224896 


Hypothetical, unclassified, unknown 


6227081 


6227449 


Hypothetical, unclassified, unknown 


6228243 


6227602 


Hypothetical, unclassified, unknown 


6236228 


6235830 


Central intermediary metabolism 


6244945 


6243110 


Cell wall / LPS / capsule 


6247689 


6246325 


Energy metabolism 


6248235 


6247810 


Energy metabolism 


6249653 


6248277 


Energy metabolism 


6250544 


6249684 


Energy metabolism 


6252139 


6250595 


Energy metabolism 


6252694 


6252158 


Energy metabolism 


6253176 


6252706 


Energy metabolism 


6253491 


6253234 


Energy metabolism 


6254410 




Energy metabolism 


6254807 


6254427 


Cell division 


6255843 


6254971 


Hypothetical, unclassified, unknown 


6260053 


6259670 


Hypothetical, unclassified, unknown 


6263563 


6261827 


Translation, post-translational modificatic 


n 6264211 


6263804 


Translation, post-translational modificatic 


m 6264360 


6264226 
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4269 


4200 


0.9999719 


rpoC ; 


4270 ; 


4074 


0.9999719 


rpoB \ 


3640 ! 


3522 


0.9999058 


dnaE ; 


1156 


2892 


0.9995054 


nrdA 

— rr » 


! 3834 


2853 


0.9994519 


valS \ 


1 4560 1 


2832 


0.9994208 


ileS ! 


595 ! 


2775 


0.999327 ) 


ostA ' 


: 3168 } 
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jacetyl-CoA acetyltransferase 

iprobable molybdopterin biosynthesis protein MoeB 

^conserved hypothetical protein . 

jhypothetical protein — 

! alginate biosynthesis protein Alg44 ^ 

j hypothetical protein' 

probable hydrolase 

succinyi-CoA synthetase beta chain 

hypothetical protein — 

probable ATP-binding connponent of ABC transporter 

probable acyl-CoA dehydrogenase 

phosphoglvcerate kinase 

hypothetical protein ^ 

hypothetical protein 

probable RND efflux membrane fusion protein precursor 

probable peptidic bond hydrolase 

probable multidmg resistance efflux pump 

succinyl-diaminopimelate desucclnylase ^ 

consen/ed hypothetical protein 

probable acyl-CoA dehydrogenase 

general secretion pathway protein L ^ 

probable permease of ABC transporter 

ervthronate-4-phosphate dehydrogenase — 

conserved hypothetical protein 

hypothetical protein 

probable acyl-CoA thiolase ^ 

conserv/ed hypothetical protein „ 

lipid A-dlsaccharide synthase — 

LPS biosynthesis protein WbpG 

hypothetical protein , 

tRNA methvltransferase 

probable acyl-CoA dehydrogenase ^ 

conserved hypothetical protein 

consen/ed hypothetical protein 

still frameshift type 4 fimbrial biogenesis protein PilC 

cytochrome c oxidase, subunit II 

conserved hypothetical protein ^ 

riboflavin-specific deaminase/reductase 

probable glycosyltransferase WbpH 

glycine cleavage system protein T2 

muconate cycloisomerase I 

UDP-glucose:(heptosyl) LPS alpha 1.3-glucosyltransferase WaaG 

hypothetical protein 

glutamate 5-kinase 

conserved hypothetical protein ^ 

hypothetical protein __- ^ 

conserved hypothetical protein . 

hypothetical protein 

aspartate semialdehyde dehydrogenase 

rod shape<letermining protein 
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ialcohol deh ydrogenas e (Zn^jependent) 

Iconserved hypothetical protein 

j hypothetical protein 



(peptide chain release factor 2 
i hypothetical protein 
iconserved hypothetical protein 
j hypothetical protein. 
'conserved hypothetical protein 



13-phosphoserine aminotransferase 
peptide chain release factor 1 



phospho-N-acetylmuramovl-pentapeptide-transferase 



hypothetical protein 



probable aminotransferase WbpE 



3hosphO"2-dehydro-3-deoxyheptonate aldolase 



Q6-methvlguanine-DNA methyltransferase 



Sp^N-aS N-acetylglucosamlne ; 



uroporphyrinogen decarboxylase 



lypothetical protein 



leptosyltransferase 



conserved hypothetical protein 



fructose-1 ,6'bisphosphate aldolase 



type 4 fimbrial biogenesis protein PilM 



probable UDP>N-acetylglucosamine 2-epimerase Wbpl __ _ 1 



phosphoribosylaminoimidazole synthetase 



JDP-3-0-[3-hydroxylauroyll glucosamine N-acyltransferase 



quinolinate synthetase A 



probable secretion protein 



probable alcohol dehydrogenase (Zn-dependent) 



conserved hypothetical protein 



dTDP-P-glucose 4.6-dehydratase 



outer membrane protein OprF precursor 



hypothetical protein 



conserved hypothetical protein 



translocation protein in type III secretion 
O-antigen chain length regulator 



TolA protein 



transcriptional regulator MexT 



conserved hypothetical protein 



acetylpolyamine aminohydrolase 



conserved hypothetical protein 



RecA protein 



conserved hypothetical protein 



heptosyltransferase H 



hypothetical protein 



rod shape<letermininq protein MreB 



hypothetical protein 



DNA polymerase III, delta subunit 



tetrahydrodipicolinate succinylase 

probable binding protein component of ABC transporter 



hypothetical protein 



conserved hypothetical protein 



wo 03/089572 PCT/US02/35518 

186 



iTonB protein 

io-sialoglycoprotein endopeptidase 

jferrochelatase „ 

iconsefved hypothetical protein 

1 probable aminotransferase 

Iprobable transmembrane sensor 

U DP>N-acetylpyruvoylgiucosamine reductase 

hypothetical protein 

acetoin cataboHsm protein AcoB 

consented hypothetical protein . 

probable oxidoreductase 

hypothetical protein 

qlycosyltransferase WbpL _, 

hypothetical protein 

probable transposase 

flagellar motor switch protein FliG 

h ypothetical protein - . 

probable transposase 

probable transposase 

probable transposase . , - . 

probable transposase . 

phenylalanyl-tRNA synthetase, alpha-subunit 

probable permease of ABC transporter 

hypothetical protein 

hypothetical protein - 

f ructose-j ,6-bisphosphatase 

consented hypothetical protein . 

probable ATP-binding component of ABC transporter 

consen/ed hypothetical protein 

hypothetical protein 

probable Omethyltransferase . 

aspartate carbamoyltransferase 

glyceraldehyde S-phosphate dehydrogenase 

conserved hypothetical protein 

DNA-directed RNA polymerase alpha chain 

probable pyruvate dehydrogenase E1 component, beta chain 

tetraacyldisaccharide 4*-kinase 

rod shape-detennining protein MreC 

D-lactate dehydrogenase (fermentative) 

sulfate transport protein CysA 

probable nucleoside hydrolase 

pyrldoxal phosphate biosynthetic protein PdxA 

probable transmembrane sensor 

DNA polymerase 111, delta prime subunit 

L-asparaginase I 

hypothetical protein 

probable bacteriophage integrase 

lipoate synthase 

hypothetical protein ^ 

consen/ed hypothetical protein 

hypothetical protein 

probable transcriptional regulator , 
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'conserved hypothetical protein 



;consen/ed hypothetical protein 



idelta 2HSQpentenylpyrophosphate transferase 
•octaprenyl-diphosphate synthase 



j hypothetical protein 



ihypotheticai protein 



i probable enoyl-CoA. hydratase/isomerase 
i thiamine monophosphate kinase 



h ypothetical protein 



lypothetical protein — j 



wpothetical protein , , 

probable serine/threonine dehydratase, degradative . — _ S 



3seudouridine synthase 



conserved hypothetical protein 



lypothetical protein 



ribosomal large subunit pseudouridine synthase C 



probable transcriptional regulator 



hypothetical protein 



probable transmembrane sensor 



h ypothetical protein 



probable transcriptional regulator 



probable transcriptional regulator 



llutathione synthetase 



lypothetical protein 



probable adhesion protein 



acetyl-coenzyme A carboxylase carboxyl transfer ase (alpha subunit) 



probable transcriptional regulator 



probable NAD-dependent epimerase/dehydratase WbpK 



probable oxidoreductase WpbB 



probable transmembrane sensor 



hypothetical protein 



glycyl-tRNA synthetase alpha chain 



probable lipase 



LytB protein 



consen/ed hypothetical protein 



probable hydrolase 



ribose-phosphate pyrophosphokinase 



hypothetical protein 



conserved hypothetical protein 



porphobilinogen deaminase 

probable transcriptional regulator 



probable lauroyi acyltransferase 



transcriptional regulator Pb<R 



probable transcriptional regulator 



hypothetical protein 



malonyl-CoA-facyl'Carrier-protein^ transacylase 



riboflavin kinase/FAD synthase 



conserved hypothetical protein 



conserved hypottietical protein 



probable phosphatidate cytidylyltransferase 



hypothetteal protein 



carbamate kinase 
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h ypothetical protein 



! probable transcriptional regulator 
icatechoi 1 ,2-dioxygenase 



D-alanyl-D-alanine-endopeptidase 



probable transcriptional regulator 



hypothetical protein 



probable epimergse 



pase LipC 



probable transcriptional regulator 



electron transfer flavoprotein alpha-subunit 



FdhE protein 



lypothetical protein 



H ypothetical protein 



conserved hypothetical protein 



probable ATP-blnding component of ABC transporter 



probable cytochronne c 



probable transcriptional regulator 



lypothetical protein 



Jrobable transcriptional regulator 



probable permease of ABC transporter 



lypothetical protein 



probable transcriptional regulator — ^ i 



probable transcriptional regulator 



probable transcriptional regulator 



GTP'bindinc^ protein Era 

probable transcriptional regulator 



pyrroioquinoline quinone biosynthesis protein B 



probable cytochrome c oxidase assembly factor 



hypothetical protein 



lypothetical protein 



probable short chain dehydrogenase 



hypothetical protein 



hypothetical protein 1 



hypothetical protein 



hypothetical protein 



UDP>3-0'acyl-N-acetylglucosamine deacgtylase 



probable transcriptional regulator 



probable transcriptional regulator 



conserved hypothetical protein 



probable binding protein component of ABC transporter 



dTDP-4-dehydrorhamnose reductase 



conserved hypothetical protein 



probable transcriptional regulator 



hypothetical protein 



hypothetical protein 



probable transcriptional regulator 



probable transferase 

probable two-component response regulator 



probable transcriptional regulator 



h ypothetical protein 



cysteine synthase B 



probable (jlycosyl transferase 
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hypothetical protein 



probable transcriptionai regulator 



probable transcriptional regulator 



probable transcriptional regulator 



probable transcriptional regulator 



probable transcriptional regulator 



j cytochrome o ubjquinol oxidase protein CyoE 
i probable 3>hydroxyisobutyrate dehydrogenase^ 
probable transcriptional regulator 



4-hydroxybenzoate-octaprenyl transferase 



probable chemotaxis protein 



conserved hypothetical protein 



probable 2-OH'lauroyitransferase 



probable transcriptional regulator 



succinyl'CoA synthetase alpha chain 



conserved hypothetical protein 



g era ny Itra n s tra n sf e ra se 



lypothetical protein 



bosomal protein L11 methyltransferase 



glucose-1 -phosphate thymidylyltransferase 



conserved hypothetical protein 



ypothetical protein 



dihydrodipicolinate synthase 



leat shock protein HtpX 



methyltransferase PilK 



probable transcriptional regulator 



conserved hypothetical protein 



lypothetical protein 



chromosome partitioning protein SpoOJ 



hypothetical protein 



acetyl-CoA carboxylase beta subunit 



elongation factor Ts 



cell division protein ZipA 



ATP synthase A chain 



outer membrane protein PopN 



ipase modulator protein 



ivpothetical protein 



probable transcriptional regulator 



conserved hypothetical protein 



hypothetical protein 



cell division protein FtsQ 



malonate decarboxylase beta subunit 



ATP synthase gamma chain 



probable hydrolase 



hypothetical protein 



probable short-chain dehydrogenase 



spermidine synthase 



probable ATP-binding component of ABC transporter 



hypothetical protein 



hypothetical protein 



probable ATP-blnding component of ABC transporter 



hypothetical protein 
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; probable transcriptional regulator . 

j5.10-methylene-tetrahydrofolate dehydrogenase / cyclohydrolase 
Sigma factor RpoH 



signal peptidase I . — 1 

lypothetical protein .. — i 



formyltetrahydrofoiate deformylase 



dihydropteroate synthase 

conserved hypothetical protein 



3robable potassium channel , j 



sopentenyl monophosphate kinase — 1 



lypothetical protein 



lypothetical protein 



nicotinate-nucleotide pyrophosphorylase 



conserved hypothetical protein 



lypothetical protein 



2-dehydro-3-deoxyphosphooctonate aldolase 



probable transmembrane sensor 



urease accessory protein 



flagellar synthesis regulator FleN 



probable ATP-binding component of ABC transporter 



probable phenazine biosynthesis protein 



ATP'binding component of ABC phosphonate transporter 



probable ATP-binding component of ABC transporter 



probable permease of ABC transporter 



probable short-chain dehydrogenase 



diaminopimelate epimerase 



probable two-component response regulator 



conserved hypothetical protein 



viH3'dependent NAD synthetase 



NosY protein 



probable biotin synthesis protein BioC 



probable transcriptional regulator 



type 4 fimbrial biogenesis protein PilW 



probable chemotaxis protein methyltransferase 



conserved hypothetical protein 



SOS ribosomal protein L2 



lypothetical protein 



probable oxidoreductase 



probable permease of ABC taurine transporter 



conserved hypothetical protein 



hypothetical protein 



hypothetical protein 



phosphatidate cytidylyltransferase 



phosphatidylserine synthase 



hypothetical protein 



thiosulfate sulfurtransferase 



hypothetical protein 



conserved hypothetical protein 



hypothetical protein 



hypothetical protein 



hypothetical protein 



probable transcriptional regulator 
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j probable permease of ABC transporter 



conserved hypothetical protein 



j probable transcriptional regulator 



; hypothetical protein 



dihydrodipicoiinate reductase 
hypothetical protein 



lipopolysaccharide core biosynthesis protein WaaP 



tryptophan synthase alpha chain 



probable ATP^binding component of ABC transporter 



hypothetical protein 



probable permease of ABC-2 transporter 



probable transcriptional regulator 



conserved hypothetical protein 



yoKpoprotein diacylglyceryl transferase 



i>methyl-2-oxobutanoatehydroxymethyltransferase 



consen/ed hypothetical protein 



probable permease of ABC transporter 



probable permease of ABC transporter 



probable enoyl-CoA hydratase/isomerase 



probable transcriptional regulator 



conserved hypothetical protein 



thymidyiate synthase 



hypothetical protein 



lypothetical protein 



probable enoyl-CoA hydratase/isomerase 



xanslocatton protein in type 111 secretion 



hypothetical protein 

probable pill assembly chaperone 
probable transcriptional regulator 



probable plasmid partitioning protein 



probable permease of ABC transporter 



conserved hypothetical protein 



methionine aminopeptidase 



conserved hypothetical protein 



hypothetical protein 



hypothetical protein 



probable transcriptional regulator 



probable permease of ABC-2 transporter 



probable NAD(P)H dehydrogenase 



conserved hypothetical protein 



hypothetical protein 



ferredoxin-NADP+ reductase 



UDP'N-acetylglucosamine acyltransferase 



probable permease of ABC transporter 



conserved hypothetical protein 



probable enoyl-CoA hydratase/isomerase 



polyamine transport protein PotC 



ubiquinone biosynthesis methyltransferase UbiE 
h ypothetical protein 



transcriptional regulator PrtR 



conserved hypothetical protein 
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probable transcriptional regulator 

conserved hypothetical protein 

probable exopolysaccharide transporter 1 

probable ATP-binding component of ABC transporter « 

h ypothetical protein — 

probable short-chain dehydrogenase ^ 

conserved hypothetical protein — 

3-deoxy-manno-octulosonate cytidylyltransferas e ; 

probable transporter — i 

conserved hypothetical protein ^ j 

probable transcriptional regulator : 

probable thioesterase 1 

hypothetical protein — — 

hypothetical protein — 

hypothetical protein __ 

cis>1.2-dihydroxycyciohexa-3,4-diene carboxylate dehydrogenase . 

probable enoyl-CoA hydratase/lsomerase 

moiybdopterin biosynthesis MoeB protein 

probable short-chain dehydrogenase _ 

consers/ed hypothetical protein , . • 

heme exporter protein CcmC _ 

hypothetical protein _ __ 

tRNA (guanlne-ND-methyltransf erase ^ _ 

type 4 fimbrial biogenesis protein PilF — 

imidazoleglyceroi-phosphate synthase, cyclase subunit __^ _ 

conserved hypothetical protein 

triosephosphate isomerase 

uroporphyrinogen-lll synthetase 

undecaprenyl pyrophosphate synthetase 

hypothetical protein , 

probable cobalamin blosynthetic protein 

hypothetical protein 

hypothetical protein 

hypothetical protein 

probable short-chain dehydrogenase 

hypothetical protein 

electron transfer flavoprotein beta-subunit 

consers/ed hypothetical protein 

hypothetical protein 

hypothetical protein 

hypothetical protein 

hypothetical protein 

hypothetical protein 

3-oxoacyl-[acyl-carrler-proteinl reductase , j 

hypothetical protein 

hypothetical protein 

j probable pili assembly chaperone 
j pNA polymerase III, epslion chain 

SOS ribosomal protein S2 

hypothetical protein 

hypothetical protein 

probable nucleoside phosphorylase 
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luridylate kinase _ 

! h ypothetical protein — 

1 probable permease of ABC transporter . 

j hypothetical protein - 

j hypothetical protein . 

iconserved hypothetical protein ^ — 

1 hypothetical protein , — - 

j hypothetical protein • 

I hypothetical protein 

jtranscriptional regulator RhIR — _ __ \ 

probable phosphoribosyl transferase 

' probable ATP-binding component of ABC transporter : 

conserved hypothetical protein — ^ 

conserved hypothetical protein — — . 

hypothetical protein — — ■ ; 

probable biotin biosynthesis protein bioH _ 

probable permease of ABC transporter : ^ 

hypothetical protein ^ — H 

conserved hypothetical protein — — > 

ribonuclease PH ^jr^ ; ■ — 1 

probable glutamine amidotransferase ^ : ^ — — 1 

conserved hypothetical protein — 

probable transcriptional regulator — \ 

conserved hypothetical protein — j 

probable transcriptional regulator . — — ; 

DNA'SpecifIc endonuclease I — \ 

probable transcriptional regulator . __ — 1 

hypothetical protein ^ — . ___ — — j 

hypothetical protein . — — 1 

probable pill assembly chaperone i 

probable two-component response regulator — ! 

phosphoribosylaminoimidazole-succinocarboxamide sy nthase • 

succinate dehydrogenase (B subunit) _ ___ -~ j 

hypothetical protein • — ^| 

consen/ed hypothetical protein ____—— ■ 

h ypothetical protein — — 

conserved hypothetical protein — — 

conseryed hypothetical protein 

consen/ed hypothetical protein — — — — j 

probable ATP-binding component of ABC transporter __ 

heme exporter protein CcmA _„ — — 

hypothetical protein — — — — — 1 

hypothetical protein — \ 

probable transcriptional regulator — — \ 

3>demethylubiquinone-9 3>methyltransferase 1 

probable hydrolase — — — • j 

orotidine 5'-phosphate decarboxylase ^ . j 

hypothetical protein j 

h ypothetical protein — ] 

iSQS ribosomal protein L1 ^ — j 

iconserved hypothetical protein . ^ 1 

I probable permease of ABC tran spo rter — — • 
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•. hypothetical protein , — - 

'probable acetyltransferase . 

? hypothetical protein — _ 

i TolQ protein : - 

hypothetical protein ^ ^ — — — 

!consen/ed hypothetical protein 

1 hypothetical protein . , ■ . 

jprobable pseudouridylate synthase . * 

cytidylate kinase . -j 

hypothetical protein — 1 

two-component response regulator ^ __ ! 

30S ribosomal protein S3 — _ — j 

hypothetical protein — — ; 

probable transcriptional regulator 1 

dethiobiotin synthase — ■ 1 

molybdenum transport protein IVlodB — — 1 

transcriptional regulator Dnr _ _ — — j 

hypothetical protein ^ . — — 

probable ATP-binding component of ABC transporter _ _ 

leucyl/phenylalanyl-tRNA-protein transferase . _ 

conserved hypothetical protein ^ — — = \ 

NADH dehydrogenase I chain B — : ! 

probable two-component response regulator . — — 

hypothetical protein — 

probable transcriptional regulator ^ — — 

Na+-translocating NADH:unlquinone oxidoreductase subunit Nqr4 

ribonuciease T — — 

hypothetical protein — — 

consen/ed hypothetical protein . — _ — — 

DNA repair protein RadC ^ _ — 

ribulose-phosphate 3-epimerase 

hypothetical protein — — — — 

hypothetical protein — — — 

heme exporter protein CcmB — — 

hypothetical protein — — 

urease accessory protein UreF ____- — 

ribose 5-phosphate isomerase 

phosphoribosylaminoimidazole synthetase ~~ — 

probable transcriptional regulator — 

probable transcriptional regulator , — 

probable transcriptional regulator __„^ — — 

hypothetical protein ^ _ — h 

conserved hypothetical protein — 

probable acetyltransferase . ^ — 

probable transcriptional regulator — — 

probable two-component response regulator ■ 

conserved hypothetical protein _^ — 

2 -keto-3-deoxy-6-phosphogiuconate aldolase 

probable permease of ABC transporter __ 

h ypothetical protein — 

hypothetical protein _ 

probable transcriptional regulator _^ 
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i hypothetical protein 
I hypothetical protein 



{riboflavin synthase alpha chain 



j conserved hypothetical protein 



ithiopurineTnethy I transferase 



i probable permease of ABC transporter 
{alginate o-acetyltransferase AlgF 

! conserved hypothetical protein 

j conserved hypothetical protein 

I probable transcriptional regulator 

Ipyridoxine 5'-phosphate oxidase 



lypothetical protein 



lypothetical protein 



probable carboxylesterase 



lyoverdine biosynthesis protein PvcD 



3robable transcriptionai regulator 



probable carbonic anhydrase 



conserved hypothetical protein 



jrobable pyridoxamine 5'-phosphate oxidase 



"probable pyridoxamine 5 -phosphate oxidase 



hypothetical protein 



probable glutathione S-transferase 



ivpothetical protein 



transcriptional regulator Vfr 



lypothetical protein 



orotate phosphoribosyltransferase 



glutamine amidotransferase 



lypothetical protein 



lypothetical protein 



maleylacetoacetate isomerase 



probable radical activating enzyme 



lypothetical protein 



uracil phosphoribosyltransferase — — 1 



h ypothetical protein 



endonuclease 



probable transcriptional regulator 



probable antioxidant protein 



hypothetical protein 



consen/ed hypothetical protein 



hypothetical protein 



probable tolQ-type transport protein 



pseudouridine synthase RluA 



conserved hypothetical protein 



SOS ribosomal protein L3 



conserved hypothetical protein 



hypothetical protein 



hypothetical protein 



probable two-component response regulator 



thymidylate kinase 



conserved hypothetical protein 



conserved hypothetical protein 



probable aromatic acid decarboxylase 



wo 03/089572 



196 



PCT/US02/35518 



I hypothetical protein . • 

I probable nuclease — . — 

Iperiplasmic chaperone LolA ^ — _ — 

j hypothetical protein ^ — ' 

I probable oxidoreductase ^ ^ — 

I h ypothetical protein _ — — — ' 

hypothetical protein . — — — — , 

cell division protein FtsJ - 

hypothetical protein 

hypothetical protein ■ — -\ 

hypothetical protein , — — — 1 

30S ribosomal protein S4 ____ — 1 

homoserine kinase _ — — 

hypothetical protein — . — 

probable lipoprotein localization protein LolB _ _ 

stringent starvation protein A — — 

hypothetical protein ... — — 

GTP cyclohydrolase H _ — 

hypothetical protein — — — 

henne acquisition protein HasAp _ — — 

probable transcriptional regulator __ — — 

probable peptide chain release factor . _ 

probable ribosomal protein L25 — — 

hypothetical protein _ — — 

hypothetical protein . ^ ■ 

hypothetical protein _ — — 

hypothetical protein ^ ___ — — ; ■ 

hypothetical protein 

consen/ed hypothetical protein __— — — 

puanylate kinase , — — — 

glutamine amidotransferase — — 

conserved hypothetical protein , — 

hypothetical protein ___ 

protocatechuate 3.4-dioxygenase, alpha subunit . ^ 

hypothetical protein ^ ^ — — 

autoinducer synthesis protein LasI — — — 

probable CIp-famlly ATP-dependent protease ^ _ 

hypothetical protein -» — — 

SOS ribosomal protein L4 — — 

conserved hypothetical protein — — 

probable nitroreductase ^ — — 

hypothetical protein — — — 

hypothetical protein „ 

hypothetical protein .. — — 

conserved hypothetical protein ^ . ■ 

probable molybdopterin-guanine dinucleotide biosynthesis p rotein MobA ^ , 

hypothetical protein — — 

cytochrome c-type protein NapC — 

conserved hypothetical protein — — 

hypothetical protein . « — 

hypothetical protein — _ — j 

probable phosphoheptose isomerase — — ' 
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i probable type 11 secretion system p rotein 
i hypothetical protein 
j hypothetical protein 



hypothetical protein 



hypothetical protein 



hypothetical protein ^ 



type 4 fimbria! biogenesis protein PilX 



conserved hypothetical protein 



anti-siama factor MucA 

probable transcriptional regulator 



hypothetical protein 



conserved hypothetical protein 



conserved hypothetical protein 



hypothetical protein 



3eptidyl-tRNA hydrolase 



lypotheticai protein 



superoxide dismutase 



lypothetical protein 



hypothetical protein 



conserved hypothetical protein 



l ypothetical protein 



lypothetical protein 



probable DNA invertase 



probable acetyltransferase WbpD 



h ypothetical protein 



hypothetical protein 



hypothetical protein 



conserved hypothetical protein 



hypothetical protein 



conserved hypothetical protein 



ribosomal protein alanine acetyltransferase 



probable transcriptional regulator 



hypothetical protein 



conserved hypothetical protein . 



consen/ed hypothetical protein 



translation elongation factor P 



conserved hypothetical protein 



hypothetical protein 



transcriptional regulator NfxB 



probable transcriptional regulator 



probable protease 



alky! hydroperoxide reductase subunit C 



peptidyl-prolyl cis-trans isomerase A 



hypothetical protein 



hypothetical protein 



hypothetical protein 



GTP cyclohydrolase I precursor 



probable transcriptional regulator 



heat shock protein GrpE 



CDP-diacylglycerol-giycerol>3-phosphate 3-phosphatidyltransferase 

jconservedhypothetical protein 

jconserved h ypothetical protein 
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! conserved hypothetical protein 

; hypothetical protein . 

1 hypothetical protein 

[hypothetical protein ^ 

(probable transcriptional regulator 

^conserved hypothetical protein 

hypothetical protein' 

translation initiation factor lF-3 

potassium-transporting ATPase. C cha in 

probable transcriptional regulator 

conserved hypothetical protein 

conserved hypothetical protein 

adenine phosphoribosyltransferase 

conserved hypothetical protein 

dTDP-4-dehydrorhannnose 3,5-epimerase 

hypothetical protein 

probable sigma-70 factor. ECF subfamily 

GTP cyclohvdrolase I precursor 

hypothetical protein 

cytochrome C biogenesis protein CcmG 
conserved hypotiietical protein 
SOS ribosomal protein L5 

hypothetical protein 

conserved hypothetical protein 
conserved hypothetical protein 
ATP synthase delta chain 

NosL protein 

conserved hypothetical protein 
conserved hypothetical protein 
conserved hypothetical protein 
SOS ribosomal protein L6 
conserved hypothetical protein 
conserved hypothetical protein 
transcription antitermination protein NusG 

hypothetical protein 

hypothetical protein 

conserved hypothetical protein 

iactoylglutathione lyase 

outer membrane lipoprotein OmIA 

hypothetical protein 

inorganic pyrophosphatase 

probable sigma-70 factor, ECF subfamily 

conserved hypothetical protein 

conserved hypothetical protein 

hypothetical protein 

16S rRNA processing protein 

hypotiietical protein 

hypothetical protein 

heme d1 biosynthesis protein NirL 

h ypothetical protein 

hypothetical protein 
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i general secretion pathway protein M 

jhypotlnetical protein 

jiiypothetical protein 

j hypothetical protein ^-^ 

conserved hypothetical protein 

consen/ed hypothetical protein 

heat shock protejn HscB 

hypothetical protein 

hypothetical protein 

general secretion pathway protein H 

conserved hypothetical protein 

shikimate kinase 

h ypothetical protein 

consen/ed hypothetical protein 

phosphatidylglycerophosphatase A 

beta-hydroxydecanoyl'ACP dehydrase 

conserved hypothetical protein 

transcriptional regulator PyrR 

consen/ed hypothetical protein 

spermidine acetyltransferase 

prolipoprotein signal peptidase 

hypothetical protein 

conserved hypothetical protein 

probable transcriptional regulator 

conserved hypothetical protein 

probable sigma-70 factor. ECF subfamily 

conserved hypothetical protein 

hypothetical protein 

hypothetical protein ^ 

probable outer membrane protein precursor 

probable outer membrane protein 

hypothetical protein 

transcription elongation factor GreB 

dihydrofolate reductase 

hypothetical protein 

hypothetical protein 

SOS ribosomal protein L1Q 

30S ribosomal protein S5 

hypothetical protein 

hypothetical protein 

hypothetical protein 

NADH dehydrogenase I chain J 

hypothetical protein 

hypothetical protein 

hypothetical protein 

single-stranded DNA-binding protein 

hypothetical protein . 

secretion protein SecB 

conserved hypothetical protein 

Icytochrome c^type protein NapB precursor 
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: probable phenazine biosynthesis protein 
! probable phenazine biosynthesis protein 
j probable phenazine biosynthesis protein 



hypothetical protein 



hypothetical protein 



probable phenazine biosynthesis protein 



conserved hypothetical protein 



hypothetical protein 



toluate 1 ,2-dioxygehase beta subunit 

2-amino>4-hvdroxy-6-hydroxymethyldihy dropteridine pyrophosphoklnase 



peptidyl-prolyl cis-trans isomerase SlyP 



hypothetical protein 



hypothetical protein 



hypothetical protein 



hypothetical protein 



I 



molybdopterin biosynthetic protein C 



conserved hypothetical protein 



hypothetical protein 



conserved hypothetical protein 



hypothetical protein 



hypothetical protein 



NusB protein ^ — — ^ 



phosphopantetheine adenylyltransferase 



hypothetical protein 



hypothetical protein . — — \ 



hypothetical protein 



hypothetical protein 
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